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The  Kalman  filter  has  been  widely  accepted  to  be  useful  in  many 
areas  of  applications.  To  implement  the  Kalman  filter  for  a system  of 
interest,  covariances  of  noise  disturbing  the  system  must  be  completely 
specified.  In  most  practical  cases,  the  noise  covariances  are  either 
inexactly  known,  unknown,  or  subject  to  abrupt  changes.  In  this 
dissertation,  Kalman  filtering  under  the  above  uncertainties  in  noise 
covariances  is  analyzed.  The  dissertation  consists  of  three  main  parts. 
The  first  part  considers  situations  in  which  the  Kalman  filter  is 
designed  by  using  incorrect  noise  covariances.  Behavior  of  the  Kalman 
filter  under  incorrect  noise  covariances  is  then  analyzed  in  this  part. 
The  second  part  discusses  direct  estimation  of  the  noise  covariances 
when  they  are  unknown.  Finally,  cases  in  which  the  noise  covariances 
are  subject  to  abrupt  changes  are  investigated  in  the  third  part. 
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CHAPTER  I 
INTRODUCTION 


In  1960,  Kalman  published  his  celebrated  paper  on  a new  approach 
to  linear  filtering  and  prediction  problems  (Kalman,  1960).  In  his 
paper,  Kalman  formulated  and  solved  the  classical  Wiener  filtering 
problem  from  the  state-space  point  of  view.  The  solution  was  given  in 
terms  of  recursive  equations  which  are  known  today  as  the  discrete-time 
Kalman  filter.  The  continuous -time  version  of  the  filter  was  later 
given  by  Kalman  and  Bucy  (1961).  Since  then,  the  Kalman  filter  has  been 
extensively  studied  and  widely  used  in  many  areas  of  applications.  To 
implement  the  Kalman  filter  for  a system  of  interest,  both  statistical 
and  dynamical  model  parameters  of  the  system  must  be  completely 
specified.  The  exact  values  of  the  model  parameters,  however,  are 
hardly  known  in  most  circumstances.  Consequently,  considerable  research 
has  been  done  on  the  effect  of  errors  in  model  parameters. 

Heffes  (1966),  Nishimura  (1966,  1967a,  1967b,  1970),  and  Aasnaes 
and  Kailath  (1974)  studied  the  effect  of  errors  in  statistical  model 
parameters , while  the  effect  of  errors  in  dynamical  model  parameters  was 
studied  by  Neal  (1967),  Koussoulas  and  Leondes  (1986).  Fagin  (1964), 
Huddle  and  Wismer  (1968),  Griffin  and  Sage  (1968,  1969),  Fujita  and 
Fukao  (1970),  Lainiotis  and  Sims  (1970),  and  Brown  and  Sage  (1971a, 
1971b)  investigated  the  effect  of  errors  in  both  statistical  and 
dynamical  model  parameters.  A class  of  error  phenomena  known  as 
divergence  was  examined  by  Schlee  et  al.  (1967),  Price  (1968),  and 
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Fitzgerald  (1971) . Bounds  on  the  filter  performance  in  the  presence  of 
modeling  errors  were  derived  in  Toda  and  Patel  (1978,  1980)  and  Patel 
and  Toda  (1984).  In  spite  of  considerable  research  mentioned  above,  few 
useful  qualitative  results  have  been  obtained.  In  particular,  no 
qualitative  results  were  given  at  all  in  Fagin  (1964),  Heffes  (1966), 
Neal  (1967),  Griffin  and  Sage  (1968,  1969),  Lainiotis  and  Sims  (1970), 
and  Koussoulas  and  Leondes  (1986).  They  merely  developed  equations  for 
evaluating  the  actual  mean- squared- error  performance  of  the  Kalman 
filter  in  terms  of  parameter  errors. 

Inaccuracies  in  the  initial  state  error  covariance  and  noise 
covariances  are  quite  common.  The  initial  state  error  covariance,  for 
example,  is  often  poorly  known.  The  selection  of  the  initial  state 
error  covariance  is  usually  based  on  physical  intuition  and  common 
sense.  However,  under  certain  conditions  (Jazwinski,  1970;  Aasnaes  and 
Kailath , 1974)  the  effect  of  errors  in  the  initial  state  error 
covariance  is  lessened  as  more  and  more  measurements  are  processed.  The 
noise  covariances,  on  the  other  hand,  are  either  inexactly  known, 
unknown,  or  subject  to  abrupt  changes  in  most  practical  cases. 

In  this  dissertation,  Kalman  filtering  under  the  above 
uncertainties  in  noise  covariances  is  analyzed.  Particular  emphasis  is 
given  to  discrete- time  Kalman  filtering.  The  dissertation  comprises 
three  main  parts.  In  the  first  part,  we  consider  situations  in  which 
the  Kalman  filter  is  designed  by  using  incorrect  noise  covariances. 
Behavior  of  the  Kalman  filter  under  incorrect  noise  covariances  is  then 
analyzed.  The  results  developed  in  the  first  part  provide  useful 
insights  in  behavior  of  the  Kalman  filter  when  inexact  values  of  noise 
covariances  are  used.  Situations  in  which  the  noise  covariances  are 
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unknown  are  considered  in  the  second  part.  A direct  technique  for 
estimating  the  unknown  noise  covariances,  which  is  referred  to  as  the 
stationary  preprocessed  measurement  correlation  (SPMC)  technique,  is 
discussed  and  analyzed  in  this  part.  In  the  third  part,  we  consider 
cases  in  which  the  noise  covariances  are  subject  to  abrupt  changes.  In 
Par^::*-Cu^ar  > we  are  interested  in  abrupt  increases  in  measurement  noise 


covariances  which  can  be  used  to  model  the  effect  of  sensor  failures  in 
the  form  of  increased  inaccuracies.  A scheme  for  detecting  abrupt 


increases  in  measurement  noise  covariances  is  developed. 

The  organization  of  the  dissertation  is  as  follows.  Chapter  II 
contains  analysis  of  discrete- time  Kalman  filtering  under  incorrect 
noise  covariances.  Similar  analysis  for  continuous  - time  Kalman 
filtering  is  given  in  Chapter  III.  Discussion  and  analysis  of  direct 
estimation  of  noise  covariances  when  they  are  unknown  are  given  in 
Chapter  IV.  In  Chapter  V,  cases  in  which  noise  covariances  are  subject 
to  abrupt  changes  are  considered,  and  a scheme  for  detecting  abrupt 
increases  in  noise  covariances  is  formulated.  Conclusions  of  results 
presented  in  the  dissertation  are  given  in  Chapter  VI.  The  dissertation 
also  contains  four  appendices  which  provide  the  relevant  support 


material . 


CHAPTER  II 

BEHAVIOR  OF  THE  DISCRETE -TIME  KALMAN  FILTER 
UNDER  INCORRECT  NOISE  COVARIANCES 

In  most  practical  cases,  noise  covariances  are  not  completely 
known.  Thus,  the  values  of  the  noise  covariances  used  in  designing  the 
Kalman  filter  are  usually  the  approximated  values  of  the  actual  noise 
covariances.  It  has  been  shown  by  Fitzgerald  (1971)  that  incorrect 
values  of  the  noise  covariances  can  cause  the  filter  to  diverge. 
Because  of  common  errors  in  noise  covariances  and  possible  filter 
divergence,  it  is  important  to  investigate  and  understand  behavior  of 
the  Kalman  filter  under  incorrect  noise  covariances. 

In  this  chapter,  we  analyze  behavior  of  the  discrete -time  Kalman 
under  incorrect  noise  covariances.  In  particular,  we  are 
interested  in  the  characteristic  of  the  actual  performance  of  the  Kalman 
filter.  The  quantity  used  to  represent  the  filter  performance  is  the 
actual  one-step  predictor  error  covariance.  For  simplicity  of 
presentation,  the  initial  state  error  covariance  is  assumed  to  be 
exactly  known  throughout  the  chapter.  However,  it  should  be  noted  that 
only  minor  and  rather  straightforward  modifications  of  the  results 
presented  here  are  required  in  case  that  the  initial  state  error 
covariance  is  incorrect  as  well. 

The  investigation  given  here  is  restricted  to  linear  time- 
invariant  systems  with  stationary  noise  processes.  But  for  completeness 
of  presentation,  relevant  results  for  linear  time -varying  systems  with 
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nonstationary  noise  processes  will  be  mentioned.  Portions  of  the 
results  contained  in  this  chapter  have  been  reported  in  Sangsuk-Iam  and 
Bullock  (1987a). 

2 . 1 Preliminaries 

Consider  the  discrete  stochastic  dynamical  system  described  by 

*k+l  = Fkxk+Wk  (2.1) 

yk  = + \ (2-2) 

where  x^  and  denote  the  state  and  the  measurement,  respectively,  w^ 

and  vk  denote  respectively  the  system  noise  and  the  measurement  noise, 

assumed  white  and  zero-mean.  Furthermore,  x , {w  } , and  (v  ) are 

ok  k 

assumed  to  be  mutually  uncorrelated. 

Given  {yQ,  ) . the  linear  minimum  variance  estimate  of  x^ 

can  be  computed  recursively  by  the  standard  Kalman  filter.  Let  us  now 
consider  the  case  that  the  filter  has  been  designed  with  system  and 
measurement  noise  covariances  > 0 and  R^  > 0,  respectively,  but  yet 
the  actual  noise  covariances  are,  respectively,  Q°  and  R^.  It  should  be 
noted  that  the  estimate  ^ of  given  {yQ,  y^..,  y^},  computed 

from  the  filter,  is  no  longer  the  linear  minimum  variance  estimate. 

A 

However,  x^^  ^ is  still  an  unbiased  estimate  of  x^  . 

Let  denote  the  one -step  predictor  error  covariance  computed 
from  the  filter.  That  is  satisfies  the  following  Riccati  difference 
equation  (RDE) : 

"k+i  - Wk  - rkW  (Wi  + \)'1  Wk  + Qk  : M0  - n <2-3> 

where  the  prime  symbol  denotes  matrix  transposition,  and  n denotes  the 
covariance  of  x 
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Or  equivalently, 


ti  - Wi  + KkRkKi  + \ 


(2.4) 


“here  ■'k  WiWk  + \)‘1  “d  Fk  =-  Fk-  Vk  ■ 

Let  us  define  the  actual  one -step  predictor  error  covariance  m£  as 

the  error  covariance  associated  with  x,  , i e 

K|k-1  ’ ' ' ’ 

< C0V(3tk|k-i,  \|k.i>  ”here  *k|k-i  *k  • \|k-i  • 

It  is  straightforward  to  show  that  (Heffes,  1966) 

•Cl  - + V&  + Qk  : M°  - n <2-5) 

The  following  simple  but  useful  result,  which  follows  immediately 
from  (2.4)  and  (2.5),  is  due  to  Nishimura  (1970). 

Theorem  2.1:  If  > (<)  Q°  and  ^ > (<)  for  all  k > 0, 

then  > (<)  m£  for  all  k > 0. 

The  above  theorem  demonstrates  the  effects  of  pessimistic  and 
optimistic  designs.  If,  for  example,  the  upper  bounds  of  noise 

covariances  are  known  instead  of  their  actual  values,  the  filter  may 
then  be  designed  by  setting  the  noise  covariances  at  their  upper  bound. 
This  is  known  as  a pessimistic  design  which  gives  by  the  above 

theorem. 

Let  $(k,i)  be  the  state  transition  matrix  associated  with  F 

k’ 


i.e.  , 


M° 

“k 


*(k+l,i)  - Fk  *(k,i)  ; *(i,i)  - I V k>i>0 

From  (2.5),  can  then  be  written  as 

k - 1 

= tf(k,0)ITr  (k,0)  + 2 tf(k,i+l) [K.R?k:  + Q?]*' (k, i+1) 

i = 0 111  1 x ' 


(2.6) 


(2.7) 
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It  can  be  observed  from  (2.7)  that  m£  is  uniformly  bounded  (from 
above),  i.e.,  sug>  ||M^||  < « , if  is  exponentially  stable  and 

[KkROlCk  + Qk]  is  uniformly  bounded.  Thus,  the  following  theorem  which 
was  originally  proven  by  Price  (1968)  is  immediate. 

Theorem  2.2:  Let  Ffc,  Hk>  Qk>  R^1,  Q°,  and  r£  be  uniformly 
bounded.  If  is  uniformly  bounded  and  F^  is  exponentially  stable, 
then  is  uniformly  bounded. 


Remark — 2.1:  For  uniformly  bounded  Fk>  Hk>  Qk>  and  R^1, 

Anderson  and  Moore  (1981)  have  shown  that  is  uniformly  bounded  if 

(Fk’  \ is  uniformly  detectable,  where  = R^(R^) ' . If,  in 

addition,  (F^,  Q,  **)  is  uniformly  s tab  i 1 iz  able , then  F is 

K k 

exponentially  stable. 


Remark  2.2: 
uniform  boundedness 
be  useful.  That  is 
uniformly  bounded. 
(Fitzgerald,  1971). 


For  certain  high-precision  control  systems,  the 
°f  , itself,  may  not  be  sufficient  for  x^k  ^ to 
Mk  may  become  intolerably  large  even  though  it  is 
This  phenomenon  is  known  as  apparent  divergence 


For  constant  F,  H,  Q,  and  R,  it  is  well  known  that  under  certain 

conditions  (Caines  and  Mayne,  1970  & 1971;  Anderson  and  Moore,  1979; 

Chan  et  al.  , 1984;  De  Souza  et  al. , 1986)  Mk  converges  to  M which  is  a 

solution  to  the  following  algebraic  Riccati  equation  (ARE) : 

M = FMF'  - FMH'(HMH'  + R)—1HMF'  + Q (2.8) 

Let  K .=  FMH' (HMH'  + R)  and  F :=  F — KH  . Then  M is  called  the 
stabilizing  solution  of  the  ARE  if  F has  all  its  eigenvalues  inside  the 


unit  circle. 
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Remark  2.3:  A necessary  and  sufficient  condition  for  existence  of 
the  stabilizing  solution  is  that  (F,H)  is  detectable  and  (F.Q1*)  has  no 
unreachable  mode  on  the  unit  circle  (Kucera,  1972b;  Molinari,  1975; 
De  Souza  et  al. , 1986).  It  should  also  be  noted  that  if  the  stabilizing 
solution  of  the  ARE  exists,  then  it  is  necessarily  unique. 

To  the  author's  knowledge,  the  convergence  of  m£,  however,  has  not 
been  rigorously  established  in  any  literature.  But,  it  is  often  used  as 
a part  of  assumptions  in  some  of  literature;  see,  for  example,  Mehra 
(1970)  and  Toda  and  Patel  (1980).  It  is  clear  from  (2.5)  with  constant 
F,  H,  Q , and  R that  if  converges  to  K and  converges  to  M°,  then 
M must  satisfy  the  following  algebraic  Lyapunov  equation  (ALE) : 

M°  = FM°F'  + KR°K'  + Q°  (2.9) 

In  the  sequel,  linear  time- invariant  systems  with  stationary  noise 
processes  are  considered.  Two  important  properties  of  the  actual  one- 
step  predictor  error  covariance,  convergence  and  divergence,  will  be 
studied  in  Sections  2.2  and  2.3,  respectively. 

2 . 2 Convergence  Analysis 

In  this  section,  we  shall  establish  a sufficient  condition  for 
convergence  of  M^  . Cases  in  which  the  noise  covariances  are  known  up 
to  an  unknown  scale  factor  are  also  considered.  For  these  special 
cases,  it  will  be  shown,  in  particular,  that  under  certain  conditions, 
the  Kalman  filter  is  asymptotically  optimal  despite  incorrect  values  of 
noise  covariances.  Consequently,  the  residual  sequence  is 
asymptotically  white  under  those  conditions.  This  reveals  the 
insufficiency  of  the  whiteness  test  on  the  residual  sequence  used  in 
Mehra  (1970).  Bounds  on  the  asymptotic  filter  performance  are  derived 
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when  the  range  of  errors  in  noise  covariances  is  known. 

Let  us  first  prove  the  following  lemma  which  will  be  used  for  the 
proof  of  the  convergence  of  the  actual  one-step  predictor  error 
covariance . 

Lemma  2.1:  Let  be  a sequence  of  square  matrices  satisfying 

(1)  k>S  < 00  and 

\ = A whose  all  eigenvalues  lie  inside  the  unit  circle. 

Then  is  exponentially  stable,  i.e.,  ||$(k,i)||  < V-  k>i>0 

for  some  a > 0 and  /3  e (0,1),  where  $(k,i)  denotes  the  state 

transition  matrix  associated  with  A^. 

Proof.  Since  eigenvalues  of  are  continuous  functions  of  its 

entries  (Kato,  1976)  and  A^  converges  to  A whose  all  eigenvalues  lie 

inside  the  unit  circle,  there  exist  an  integer  nQ  and  p e (0,1)  such 

that  all  eigenvalues  of  A^  lie  inside  or  on  the  circle,  centered  at  the 

origin,  of  radius  p for  all  k > n , i e 

o ' ’ 

mfX  I Aj  I - P V k > no  (2.10) 

The  convergence  of  A^  also  implies  that  for  any  e > 0 , there 
exists  an  integer  n^  such  that 

???,  K+l  - s e (2.1D 

Based  on  the  exponential  stability  result  for  slowly  varying 
discrete  systems  given  in  Desoer  (1970)  (see  Theorem  A. 2 in  Appendix  A), 
condition  (i)  together  with  (2.10)  and  (2.11)  imply  that  there  exist 
integer  n2 , 7 > 0 , and  /3  e (0,1)  such  that 

ll$(k,i)||  < 7/3^k~  ^ V k>i>n2  (2.12) 
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It  then  follows  from  condition  (i)  and  (2.12)  that  there  exists  a 
positive  number  a such  that 

||$(k,i)||  < a0(k_i)  V k>i>0  . 

Remark  2.4:  Suppose  that  the  stabilizing  solution  M of  the  ARE 
exists  and  lim  — M . Then,  one  can  observe  that  :=  (F  - K^H) 

satisfies  both  conditions  stated  in  Lemma  2.1.  Consequently,  Fk  is 
exponentially  stable.  That  means  if  at  steady- state  conditions  the 
Kalman  filter  is  exponentially  stable  as  a time- invariant  filter,  then 
it  is  also  exponentially  stable  as  a time-varying  (but  asymptotically 
time -invariant)  filter.  A somewhat  weaker  statement  of  this  fact  was 
made  without  proof  by  Anderson  and  Moore  (1979,  p.82). 

The  conditions  for  m£  to  converge  can  be  established  as  follows. 

I^eorem — 2-3.:  Let  ( F , H)  be  detectable  and  (F.Q^)  have  no 

unreachable  mode  on  the  unit  circle.  Suppose  that  M^  converges  to  M 
which  is  the  stabilizing  solution  of  the  ARE.  Then  converges  to  M° 
which  is  the  unique  solution  of  the  ALE. 

Proof . First  notice  from  Remark  2.3  that  the  stabilizing  solution 
M of  the  ARE  exists  and  is  unique  since  (F,H)  is  detectable  and  (F,Q^) 
has  no  unreachable  mode  on  the  unit  circle.  Here,  M^  is  uniformly 
bounded  since  (F,H)  is  detectable.  Furthermore,  F is  exponentially 
stable  as  pointed  out  in  Remark  2.4.  It  then  follows  from  Theorem  2.2 
that  M^  is  uniformly  bounded.  Because  of  the  exponential  stability  of 
F , there  exists  a unique  solution  M°  to  the  ALE.  But, 

“k+l"  M°  “ ( Vk^k  + \R\  + Q°>  “ < ™°f ' + KR°K'  + Q°) 


(2.13) 
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Since  is  uniformly  bounded  and  converges  to  K,  may  be 

written  as  = K + where  uniformly  bounded  matrix  sequence 
converges  to  zero  as  k approaches  infinity.  Thus,  (2.13)  can  be 
rewritten  as 


Ci-  M° 


= ,,,o  ,-0.  — 


F(Mk  - M )F'  + 


(2.14) 


where  9. 


- KH,Ii  + v\  +™\ 

It  is  clear  that  9 is  uniformly  bounded  and  lim  9,  = 0 But 

K V — ►oo  lr 

from  (2.14),  we  have  that 

k 

o ' ' i-i  fci-l' 


'°  M° 


Mk- 


k-**>  k 


F k(M°-  M°)(F')k  + .2  F k'i04_1(F')  k_i  (2.15) 


Since  F is  exponentially  stable,  the  first  term  on  the  right  hand 
side  (RHS)  of  (2.15)  goes  to  zero  as  k approaches  infinity.  We  shall 
next  prove  that  the  second  term  on  RHS  of  (2.15)  also  goes  to  zero  as  k 
approaches  infinity. 

k , . # 

II i ?i  F 6 l-l  "ll  - i?!  Ck  for  some  a > 0 , £ e (0,1) 


2 a/3 

i = 1 r 


(k-i) 


i-11 


+ 2 ay3(k"l)||0.  | 

i =n+ 1 ^ 11  l-l1 


< n.sug  II^H  a^(k'n)  + 1S_i,a^(k_i)| 


i=n  + l 


i-11 


Given  e > 0 . Since  2 a/31  < ® and  lim  9.  = 0 , there  exist 

1 U j_  —tco  ]_ 

integers  N and  N with  N > N such  that  for  all  k > N 

o o ’ 


No  Il^ill  ^(k'No}  * e/2  and 

ii  k — k-i  _ k-i 

Hence,  2 F 9 . , (F') 

11  i = 1 l-l  ' 


(k-i) | 


i-11 


< e/2. 


< e for  all  k > N . 
Consequently,  the  second  term  on  RHS  of  (2.15)  converges  to  zero 
as  k approaches  infinity.  This  in  turn  implies  that  lim  m£ 


'°  = M° , 
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Remark — 2_J3:  In  case  that  (F,H)  is  detectable  and  (F.Q*4)  has  no 

unreachable  mode  on  the  unit  circle,  converges  to  the  unique 

stabilizing  solution  M of  the  ARE  if  one  of  the  following  conditions 
holds  (Caines  and  Mayne,  1970  & 1971;  Anderson  and  Moore,  1979;  Chan  et 
al.,  1984;  De  Souza  et  al. , 1986): 

(i)  (F , Q*5)  is  stabilizable  and  M > 0 

o 

(ii)  M > 0 or  M > M . 
o o 

Remark  2.6:  It  should  be  clear  from  the  proof  of  Theorem  2.3  that 

the  theorem  still  holds  even  though  an  incorrect  initial  state  error 

covariance  is  used,  i.e.,  M 4 n 

o 

Corollary  2.1:  Suppose  that  F has  all  its  eigenvalues  inside  the 
unit  circle.  Then  M^  always  converges  to  M°  which  is  the  unique 
solution  of  the  ALE. 


Proof:  It  is  clear  in  this  case  that  (F,H)  is  detectable  and 
( F , Q** ) is  stabilizable.  M^  therefore  converges  to  the  stabilizing 
solution  of  the  ARE.  The  convergence  of  to  the  unique  solution  of 
the  ALE  is  then  immediate  from  Theorem  2.3.  ■ 


Remark — 2.7:  If  the  filter  is  designed  with  no  system  noise,  i.e., 
Q = 0,  and  the  assumption  of  Corollary  2.1  holds,  then  the  computed 
error  covariance  converges  to  a zero  matrix.  Consequently,  m£ 
converges  to  M°  which  satisfies  M°  = F M°F'  + Q°.  For  this 
particular  case,  the  limit  of  is  therefore  independent  of  R°. 


Let  us  now  define 

y 


kik-i  : yk  ~ H *k|k-i 


(2.16) 


We  shall  refer  to  as  residual  sequence.  If  the  filter 

were  designed  by  using  the  correct  values  of  noise  covariances,  {y  . } 

K | K - 1 
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would  be  a zero-mean  white  noise  sequence  (Kailath,  1968).  Here,  the 
incorrect  values  of  noise  covariances  are  used,  y . are  therefore 

K K"  1 

correlated.  Nevertheless,  yk|kl  still  have  mean  zero. 

For  integer  h,  let  us  define 


C(k’h)  :=  C0V<yk|k-i>  yk-hlk-h-l)  : k^0  <2-17> 

It  is  straightforward  to  show  that 


C(k,h) 


HM^H'  + R°  ; h = 0 

■ H*(k,k-h+l)[  FM^_hH'  - Kk_h(HM^_hH'  + R°) ] ; k>h>0 


Remark — 2^8 : It  should  be  observed  that  if  Q = Q°  and  R = R°  , 
then  Kk_ji(HM°_kH'  + R ) = FM£_hH'  for  k>h>0.  In  such  a case,  it  can 
be  seen  from  (2.18)  that  ^yk|k-l^  is  w^^te  as  mentioned  earlier.  The 
sequence  {yk|k_-^)  . in  this  case,  is  commonly  referred  to  as  innovations 
sequence . 


The  following  corollary  which  describes  the  existence  and 
expression  of  the  limit  of  C(k,h)  is  immediate  from  Theorem  2.3  and 
(2.18). 

Corollary  2.2:  Subject  to  the  same  conditions  given  in  Theorem 

2.3, 


C 


h 


lim  C(k,h) 

k-+oo  v ’ ' 


' HM°H'  + R° 

* 

. HF  k'1  [ FM°H ' 


; h = 0 

K(HM°H'  + R°)]  ; h > 0 


(2.19) 


Remark  2.9:  In  case  that  F has  all  its  eigenvalues  inside  the  unit 
circle  and  Q = 0,  the  expression  of  can  be  simplified  as 

Ch  = HFhM°H'  + R°5h  ; h > 0 

where  M = FM  F'  + Q and  5^  denotes  the  Kronecker  delta,  i.e., 
^h  e1ua^  *:o  one  if  h = 0 and  equal  to  zero  otherwise. 
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One  can  observe  from  (2.9)  and  (2.19)  that  the  entries  of  C^,  Q°, 

and  R are  linearly  related.  If  are  given  and  the  number  of  linearly 

independent  equations  relating  the  entries  of  the  actual  noise 

covariances  and  the  entries  of  Ch  is  not  less  than  the  number  of  unknown 

entries  of  Q and  R , then  one  can  uniquely  determine  Q°  and  R°.  This 

basic  fact  has  been  used  in  Mehra  (1970)  and  Friedland  (1982)  to 

identify  the  actual  noise  covariances,  in  which  estimates  of  C,  are  used 

h 

instead  of  their  actual  value  in  solving  for  Q°  and  R°. 

In  some  circumstances,  the  noise  covariances  are  known  up  to  an 
unknown  scale  factor  (Iglehart  and  Leondes,  1974;  Burg  et  al. , 1982). 
That  is  the  actual  noise  covariances  are  of  the  form  Q°  = a.Q  and 
R = /3.R  where  a and  p are  unknown  positive  scalars.  For  this  special 
case,  the  following  result  can  be  established. 


Proposition  2.1: 

Subject  to  the  same  conditions 

given  in 

Theorem 

2.3, 

and  Q° 

= a.Q  , R° 

= j3.R  for  some  positive  scalars 

a and  p 

, then 

(i) 

M°  = 

J3.M  + (a  - p)  . Sq 

(2.20) 

P . (HMH'  + R)  + (a  - p) . HSqH' 

; h = o 

and 

(ii) 

Ch  " ‘ 

(2.21) 

l (a  - p) .HF  S H' 

q 

; h > 0 

where 

S is 

q 

the  unique 

solution  of  S = FS  F' 

q q 

+ Q 

(2.22) 

Proof : 

(i)  Since 

conditions  in  Theorem  2.3  are  satisfied, 

and 

M,  converge 

to  M and  Mq,  respectively,  and  F is  exponentially 

stable . 

From  (2.8)  and  (2.9)  together  with  the  exponential  stability  of  F,  we 
have  that 


00  _ i _ i 00  i i 

M = i^o  F ^KRK'  + Q)(F')  and  M°  = ^ F (/3.KRK'  + a.Q)(F'). 
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Thus , M 


_ i _ i 

/3.M.  + (a  - 0)  S F Q(F') 


(2.23) 

_ 00  _ i _ i 

Because  of  the  exponential  stability  of  F,  S :=  S F Q(F' ) is 

q i = o 

the  unique  solution  of  (2.22).  Hence,  M°  = 0.M  + (a  - £).S 

(ii)  Substituting  M°  = p.ft  + (a  - P).S  into  (2.19)  , one 

obtains  (2.21)  after  some  algebraic  manipulations.  ■ 

An  interesting  special  case  arises  when  a = /3  . For  this 

particular  case,  we  obtain  the  following  result. 

Corollary  2.3:  Subject  to  the  same  conditions  given  in  Proposition 
2 . 1 and  a = f)  , then 

(i)  The  one- step  predictor  estimate  obtained  from  the  Kalman 

filter  using  noise  covariances  Q and  R is  asymptotically  optimal,  i.e., 

lim  = lim  2. 

k -mo  Tc  k-KO  k 

where  2^  denotes  the  minimum,  linear  one-step  predictor  error 

covariance  when  a is  given. 

and  (ii)  The  residual  sequence  is  asymptotically  white,  i.e.,  C,=  0 for 

h 

all  h / 0. 


Proof:  (i)  It  can  be  observed  that  a ^ 2^  and  satisfy 


the 


same  RDE  with  initial  conditions  a . II  and  II  , respectively.  But 
kiS  ^ ~ M which  is  the  stabilizing  solution  of  the  ARE.  Since  the 

initial  values  of  a } 2^  and  are  different  from  one  another  by  only  a 
scale  factor,  it  can  then  be  shown  (see  Theorem  B given  in  Appendix  B) 
that  lim  2k  = a.M.  From  (2.20)  with  a - /3  , lim  = a.M  . The 

filter  is  therefore  asymptotically  optimal. 

(ii)  With  a = f} , it  is  clear  from  (2.21)  that  the  residual 
sequence  is  asymptotically  white.  ■ 


16 


The  above  corollary  indicates  the  insufficiency  of  the  whiteness 
test  on  the  residual  sequence,  suggested  Mehra  (1970)  to  determine 
whether  or  not  the  estimated  noise  covariances  are  adequate,  in  which 
the  filter  is  assumed  to  have  reached  steady- state  conditions.  It  is 
important  to  recognize  that  different  noise  covariances  can  lead  to  the 
same  optimal  steady- state  gain.  But  the  steady- state  covariances  of  the 
residual  sequences  corresponding  to  those  different  noise  covariances 
may  be  quite  different.  Alspach  (1972)  pointed  out  this  fact  by  his 
numerical  examples.  Alspach  also  suggested  an  additional  test  which  may 
be  performed  simultaneously  with  the  whiteness  test.  The  additional 
test  is  essentially  based  on  comparing  the  computed  steady-state 
covariance  and  the  estimated  steady- state  covariance  of  the  residual 
sequence,  i.e.,  HMH'  + R and  C , respectively.  As  we  can  see,  under 
the  conditions  given  in  Corollary  2.3,  the  steady-state  covariance  of 
the  residual  sequence  Cq  is  equal  to  a. (HMH'  + R)  which  is  different 
from  HMH'  + R , unless  a = 1. 

Iglehart  and  Leondes  (1974)  proposed  several  algorithms  for 
estimating  a . However,  Corollary  2.3  indicates  that  the  filter  can  be 
asymptotically  optimal  although  the  correct  value  of  a is  not  known. 
If  the  asymptotic  filter  performance  is  the  only  main  concern,  no 
estimation  of  the  unknown  a is  needed  under  the  situation  considered 
in  Corollary  2.3.  This  is  a good  example  to  show  that  better 
understanding  in  the  behavior  of  the  Kalman  filter  under  incorrect  noise 
covariances  can  help  one  avoid  unnecessary  computations. 

In  general , Q — Q + AQ  and  R = R°  + AR  where  AQ  and  AR  are 
unknown.  Based  on  Theorem  2.1,  bounds  on  the  limit  of  M^  can  be 
easily  obtained  if  AQ  and  AR  are  sign  definite  of  the  same  sign. 
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Suppose,  for  example,  that  and  converge  to  M and  M°, 

respectively,  then  M > M°  if  AQ  > 0 and  AR  > 0 . The  above 

requirement  on  sign  definiteness  of  AQ  and  AR  is,  however, 

restrictive.  Instead,  one  sometimes  has  information  about  the  range  of 

errors,  i.e.,  |Aq  | and  | Ar  | where  Aq.  . and  Ar.  . denote  the 

1J  1J  ij 

( 1 . j ) th  entries  of  AQ  and  AR,  respectively.  We  shall  next  obtain 
bounds  on  the  asymptotic  filter  performance  measured  by  the  trace  of  the 
limit  of  when  the  range  of  errors  is  known.  The  analysis  given  below 
is  similar  to  the  one  given  by  Toda  and  Patel  (1980).  Except,  we  are 
here  working  on  the  actual  one -step  predictor  error  covariance  instead 
of  the  actual  filter  error  covariance,  and  we  are  not  assuming  that  F is 
exponentially  stable,  as  required  in  Toda  and  Patel  (1980).  Let  us  now 
introduce  the  following  notations.  The  Kronecker  product  of  matrices  A 
and  B is  denoted  by  A®B  , and  vec(A)  denotes  the  column  vector 
obtained  by  concatenating  the  columns  of  matrix  A.  That  is 
vec(A)  = [ a|  ....  a^  ]'  where  denotes  the  i th  column  of  an 

m x m matrix  A.  The  Frobenius  norm  of  A is  denoted  by  ||a||^,  i.e., 

llAllf  :=  ^iSjlaijl  • With  the  above  notations,  we  shall  establish 

the  following  theorem. 

Theorem  2.4:  Subject  to  the  same  conditions  given  in  Theorem  2.3, 


tr(M) 

- P < 

tr(M  ) 

< 

tr(M)  + p 

(2.24) 

where  p 

: = 

| K ' PK  ||  f . | 

| AR  ||  ^ + 

l|P||f. 

<4-1 

"o7 

<_ 

(2.25) 

and  P 

is 

the  unique 

solution 

to  P 

= F'PF  + I 

(2.26) 

Proof : 

Since  the  conditions  in  Theorem  2.3  are  satisfied, 

and 

o o _ 

converge  to  M and  M , respectively,  and  F is  exponentially  stable. 
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But  M°  = FM°F'  + K(R  - AR)K'  + (Q  - AQ) 

FM°F'  + (KRK'  + Q)  - (KARK'  + AQ) 

Thus,  vec(M°)  = F®F  vec(M°)  + vec(KRK'  + Q)  - vec(KARK'  + AQ) 
where  we  have  used  the  identity  (Brewer,  1978)  vec(ABC)  = (C'®A)vec(B) . 

^et  T :=  (In2  - F®F)  , where  I 2 denotes  an  n2  x n2  identity 
matrix  and  n denotes  the  dimension  of  F.  Because  of  the  exponential 

stability  of  F,  all  eigenvalues  of  F®F  lie  inside  the  unit  circle. 

This  in  turn  implies  that  T is  invertible.  Hence, 

vec(M°)  = T-1vec(KRK'  + Q)  - T~ 1vec(KARK'  + AQ)  (2.27) 

Using  the  identity  tr(A)  = (vec(Im) ) 'vec(A)  where  A is  an  m x m 
matrix,  we  have  that 

tr(M°)  = (vec(In))'T  1vec(KRK'  + Q)  - (vec(In)) 'T_1vec(KARK'  + AQ)(2.28) 

By  performing  similar  algebraic  manipulations,  we  obtain 

tr(M)  = (vec(I  ))'T  1vec(KRK'  + Q)  and  (vec(P))'  = (vec(I  ))'T_:L 
n n 

Thus,  (2.28)  can  be  rewritten  as 

tr(M°)  - tr(M)  - (vec(P) ) 'vec(KARK'  + AQ)  (2.29) 

But,  (vec(P) ) 'vec(KARK' ) = (vec(K' PK) ) ' vec(AR) . Equation  (2.29) 

can  then  be  written  as 

tr(M°)  = tr(M)  - (vec(K' PK) ) ' vec(AR)  - (vec(P) ) ' vec(AQ)  (2.30) 

Observe  that  ||vec(A)||f  = ||a|| f . Thus, 

||  (vec(K'  PK) ) ' vec(AR)  + (vec(P)  ) ' vec(AQ)  ||f  < ||K'  PK||  f . ||ar||  f + ||p||  ||AQ||f 

It  is  immediate  from  (2.30)  and  the  above  inequality  that  (2.24) 


holds . 
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The  significance  of  the  above  theorem  is  that  the  bounds  of  the 
asymptotic  filter  performance  can  be  computed  only  by  knowledge  of  the 
Frobenius  norms  of  AQ  and  AR.  One  should,  however,  observe  that  the 
lower  bound  of  tr(M°)  given  in  the  above  theorem  is  useful  only  when 
tr(M)  > p since  tr(M°)  is  always  nonnegative.  It  should  also  be  noted 
that  tighter  bounds  on  tr(M  ) than  those  given  above,  if  desired,  may  be 
obtained  by  using  |Aq  | and  |Ar  | directly  instead  of  || AQ ||  and 
l|AR||f. 

Remark — 2 . 10 : If  F has  all  its  eigenvalues  inside  the  unit  circle 
and  Q = 0,  lim  ^ = M°  whose  trace  is  less  than  or  equal  to  ||P||f.  || AQ|| f 
where  P is  the  unique  solution  to  P = F'PF  + I.  This  is  clear  since 
K for  this  case  is  a zero  matrix. 

2 . 3 Divergence  Analysis 

So  far,  we  have  considered  cases  in  which  converges.  The  next 
subject  of  our  study  is  the  divergence  of  . In  particular,  we  shall 
show  that  incorrect  values  of  the  system  noise  covariance  can  cause  the 
filter  to  diverge,  i.e.,  the  actual  one-step  predictor  error  covariance 
M°  becomes  unbounded.  But  if  the  system  is  detectable,  filter 
divergence  cannot  be  caused  by  incorrect  values  of  the  measurement  noise 
covariance  alone,  provided  that  R > 0.  For  detectable  (F,H) , one  can 
observe  from  Theorem  2.3  and  Remark  2.5  that  if  (F,Q^)  is  stabilizable , 
then  incorrect  values  of  noise  covariances  cannot  cause  the  filter  to 
diverge.  It  is,  therefore,  necessary  that  F has  at  least  one 
unreachable  mode  associated  with  an  eigenvalue  lying  outside  or  on  the 
unit  circle  if  the  divergence  occurs. 

First,  let  us  prove  the  following  two  lemmas  needed  for 
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establishing  the  divergence  of 


Lemma  2.2:  Suppose  that  F has  an  eigenvalue  A with  corresponding 
left  eigenvector  z such  that  zMq  = zQ  = 0.  Then  zM^  = 0 V k > 0. 

Proof:  Here,  zM^  = AzMjJ  F'  - H'  (HM^H'  + R)— 1HM^F'  ]. 

It  then  follows  immediately  from  assumption  zM  =0  that 

o 

zM.  =0  V k > 0.  _ 


The  above  lemma  simply  states  that  if  the  mode  corresponding  to 
eigenvalue  A of  F is  assumed  to  be  initially  known  and  not  excited  by 
the  system  noise,  i.e.,  zM^  = zQ  = 0,  then  that  mode  is  considered  to  be 
exactly  known  for  all  time  instances,  i.e.,  z^  =0  V k > 0. 

Lemma — 2.3:  Let  M satisfy  the  ARE,  A be  a complex  number  with 
| A | =1,  and  z be  a,  possibly  complex  valued,  row  vector  of 
appropriate  dimension. 

Then  zF  = Az  if  and  only  if  zF  = Az  and  zQ  = 0 . 

In  such  a case,  necessarily  zK  = 0 . 


Proof 


where 


Sufficiency: 

zMz*  = 

z denotes 
From  zF  = 

Thus , zK 


Since  M satisfies  the  ARE, 

z [ FMF'  - FMH'(HMH'  + R)-1HMF'  + Q ] z*  (2.31) 

the  complex  conjugate  transpose  of  z. 

Az  and  zQ  = 0 with  | A | =1,  (2.31)  reduces  to 
zMH'(HMH'  + R)_1HMz*  - 0 . 

zFMH' (HMH'  +R)  = 0 . It  is  then  clear  that 


zF  = Az. 


Necessity:  Since  M satisfies  the  ARE, 


21 


zMz*  = z [ FMF'  + KRK'  + Q ] z*  (2.32) 

But  zF  = Az  with  | A | =1  , (2.32)  then  reduces  to 
zKRK'z*  + zQz*  = 0 . 

Hence,  zKRK'z  = zQz*  = 0 since  both  terms  are  nonnegative. 
This  then  implies  that  zK  = 0 since  R is  nonsingular,  and  zQ  = 0. 
Thus , zF  = Az . _ 


The  above  lemma  simply  states  that  if  F has  an  eigenvalue  A on  the 
unit  circle  with  associated  left  eigenvector  z in  the  left  null  space  of 
Q,  then  F has  A as  its  eigenvalue  with  the  same  associated  left 
eigenvector  z,  and  conversely.  Let  us  now  give  conditions  for  which 
diverges . 


Theorem 2Ji:  Suppose  that  F has  an  eigenvalue  A lying  on  or 

outside  the  unit  circle,  and  its  corresponding  left  eigenvector  z is 
such  that  zMo  = zQ  = 0.  If  ZQ°  ^ 0,  then  zM^z*  - » as  k ^ . 


Proof:  by  Lemma  2.2,  zM^  =0  ¥ k > 0.  It  is  then  immediate  that 


o 

ii 

N 

and  zF,  = Az 

k 

¥ k > 0. 

But 

Cl 

- fkC4  + 

v\  + Q°  • 

Thus , 

^+1Z* 

|A|2.zm£z* 

o * 

+ zQ  z 

(2.33) 

Since 

| A | > 1 and 

zQ°z*  > 0,  one 

can  then  conclude  from 

(2.33) 

that 


wO  * 

zM,  z -►oo  as 


Remark 2.11:  From  (2.33),  one  can  observe  that  zM^z*  will 

approaches  infinity  exponentially  fast  if  |A|  >1  and  linearly  fast  if 

O 

] . Hence , the  above 
theorem  indicates  that  incorrect  values  of  the  system  noise  covariance 


| A | = 1.  Notice  also  that 


ZC* 


= E[  | z 


*k|k 
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can  cause  the  variance  of  a certain  linear  combination  of  the  estimation 
error  to  become  unbounded. 

In  case  that  F has  an  unreachable  mode  on  the  unit  circle,  it  will 
be  seen  from  the  next  theorem  that  can  diverge  even  though  the 
unreachable  mode  is  not  assumed  to  be  initially  known,  i.e.,  zM  / 0. 

Theorem — 2^6.:  Subject  to  (i)  converges,  and  (ii)  F has  an 

eigenvalue  A on  the  unit  circle  with  corresponding  left  eigenvector  z 
such  that  zQ  = 0.  If  zQ°  / 0,  then  zM^z*  - « as  k - <*, 

Proof:  Suppose  that  lim  inf  zM^z*  = a for  some  finite 

nonnegative  number  a. 

From  zm£+1z*  - z[  F^F'  + KjR°K£  + Q°  ]z*. 

Thus,  lim  inf  zM°+1z*  > lim  inf  zF^F^z*  + lim  inf  zK^K^z* 

, ~0  -k 

+ ZQ  2 (2.34) 

Based  on  the  convergence  of  and  Lemma  2.3,  we  have  that 

viS  zFi,  “ * z and  lim  zK,  - 0 . 

k -MO  k k -**>  1c 

Hence,  (2.34)  reduces  to 

a > a + zQ°z* 

Since  zQ  z > 0 , we  then  reach  a contradiction. 

Therefore,  lim  inf  zM^z*  = 00  • This  in  turn  implies  that 

O -Jc 

zM^z  *♦  °°  as  k -*  « . m 

Remark  2.12:  Notice  that  conditions  (i)  and  (ii)  of  Theorem  2.6 

can  simultaneously  hold  without  assumption  zM  =0.  If  this  were  not 

o 

the  case,  Theorem  2.6  would  be  a specialized  result  of  Theorem  2.5. 
Consider  for  example: 
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-h 


1 


1 


F 


2 


• H = [ 1 0 ] , Q = 0,  and  R = 1 . 


-1 


1 


Clearly,  condition  (ii)  is  satisfied  in  this  case.  In  addition,  (F,H) 
is  observable  (and  hence  detectable) , and  M - 0 is  the  strong  solution 
of  the  ARE,  i.e.,  F has  no  eigenvalues  outside  the  unit  circle.  It  then 


V M > °.  In  particular,  if  M >0,  then  lim  M.  = 0 and  zM  4 0 

u o k-M0  k o 

Fitzgerald  (1971)  investigated  divergence  of  the  continuous -time 
Kalman  filter.  One  of  interesting  questions  addressed  by  Fitzgerald  is 
whether  or  not  incorrect  values  of  the  measurement  noise  covariance 
alone  can  cause  the  filter  to  diverge.  The  answer  to  this  question 
under  restrictive  conditions  was  also  given  in  Fitzgerald  (1971).  Here, 
we  shall  show  that  filter  divergence  will  not  occur  for  any  R,  which  has 
been  assumed  to  be  positive  definite  throughout  this  chapter,  if  (F,H) 
is  detectable  and  Q = Q . That  is  if  the  filter  is  designed  with  only 
errors  in  the  measurement  noise  covariance,  then  detectability  of  the 
system  will  guarantee  uniform  boundedness  of  the  actual  one-step 
predictor  error  covariance.  The  following  lemma,  whose  proof  can  be 
found  in  Mehra  (1976),  will  be  used  in  establishing  the  above  statement. 

Lemma  2.4:  Let  A and  B be  symmetric  matrices  of  the  same  dimension 
and  B > 0.  Then 


follows  from  De  Souza  et  al. 


(1986,  Theorem  4.2)  that  lim  M,  = 0 

k-*x>  lc 


tr(AB)  < A (A)  tr (B) 
max 


where  ^m^n(A)  and  ^max(A)  denote  the  minimum  and  maximum  eigenvalues 
of  A,  respectively. 


max 


The  following  theorem  can  now  be  established. 
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Theorem  2.7:  Let  (F,H)  be  detectable  and  Q = Q°.  Then  is 
uniformly  bounded  for  any  R > 0. 


Proof:  Based  on  (2.4)  and  (2.5),  and  can  be  expressed  as 

k - 1 

\ “ *(k,0)II®'  (k,0)  + .So  ^(k,i+l)KiRKp' (k,i+l) 


k-  1 


+ Z *(k,i+l)Q*'(k,i+l) 


(2.35) 


O k - i 

Mfc.  “ «(k,0)IIfl'' (k,0)  + ,So  ^(k,i+l)KiR°K^'  (k,i+l) 

k - 1 

+ i?0  ®(k,i+l)Q0*' (k,i+l)  (2.36) 

Because  of  the  detectability  of  (F,H),  is  uniformly  bounded. 
But  each  of  the  terms  on  the  RHS  of  (2.35)  is  nonnegative  definite. 
Consequently,  each  term  must  be  uniformly  bounded.  From  (2.36)  with 
Q°  = Q,  one  can  then  observe  that  is  uniformly  bounded  if  the  second 
term  on  the  the  RHS  of  (2.36)  is  uniformly  bounded.  From 
k * 1 k - 1 

tr(  iSQ  *(k,i+l)KiRKJ*' (k,i+l)  ) = tr(  R ^ Kj*' (k, i+l)*(k, i+1)^  ) 

- A 1ti(r)  ^k  — 0 (by  Lemma  2.4) 


k-  1 

where  £k  : = tr(  ^ K!*'  (k,  i+l)¥(k,  i+l)K  ). 

Since  the  second  term  on  the  RHS  of  (2.35)  is  uniformly  bounded, 

its  trace  must  also  be  uniformly  bounded.  In  addition,  A (R)  > 0 

min 

Hence,  £k  is  uniformly  bounded.  Let  us  denote  the  induced  Euclidean 
norm  of  matrix  A by  ||a||  , i.e.,  ||a||  = ( A (A*A))1/2.  Then, 

2 2 IIlcLX 

k-x  k-1 

II  iEo  *(k,i+l)KiR  K'*'(k,i+l)|a  = Amax(  iSo  *(k,i+l)K.R°K'*'(k,i+l)  ) 


k-l 

< tr(  , 2q  ¥(k,i+l)K.R0KJ*' (k,i+l)  ) 
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- Amax(R°)  (by  Lemma  2.4) 

The  second  term  on  the  RHS  of  (2.36)  is  therefore  uniformly 
bounded.  This  in  turn  implies  that  is  uniformly  bounded.  ■ 

2.4  Illustrative  Examples 

For  a purpose  of  demonstrating  the  results  given  Sections  2.2  and 
2.3,  we  consider  the  following  simple  scalar  system: 

Xk+1  “ a'xk  + wk  <2-37> 

yk  - xk  + vk  (2-38) 

where  a is  a real  number  and  the  covariances  of  x , w,  , and  v are 

o k k 

given  by  n > 0,  q >0,  and  r°  > 0,  respectively. 

Suppose  that  the  Kalman  filter  has  been  designed  with  no  system 
noise,  i.e.,  q = 0,  and  measurement  noise  covariance  r = 1.  The 
calculated  one -step  predictor  error  covariance  m^  and  the  actual  one- 
step  predictor  error  covariance  m£  satisfy  the  following  difference 
equations . 


\+l 

2 

3 • “k  / (“k  + -1) 

; m = 7r 
o 

(2.39) 

o 

“k+l 

, o , 2 o n 

■ hfc-k  + Wr  + q 

O O 
S 

(2.40) 

where  b,  : = 

k 

2 2 
a / (n^  + 1 )Z  . 

Solutions  to  the  above  difference  equations  for  k>0  can  be  written 
as 


(i)  For  | a | f 1, 

2k..  , . 2k 

= cTra  /(c  + 7r(a  - 1))  (2.41) 

\ - { c2tt  + C7r2(a2k—  l)r°  + [ (c  - *)2(1  - a-2k)/c 

+ 2tt(c  - ?r)k  + 7r2a2(a2k-  l)/c  ] q°  ) a2k/(c  + 7r(a2k-l))2  (2.42) 

where  c : - a^  — 1 . 
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(ii)  For  | a | = 1, 

i\  = */(*k  + 1)  (2.43) 

\ = (’r  + *2k*°  + (?r2 (k+1)  (2k+l)/6  + jr(k+l)  + 1)  kq°)/(7rk  + l)2  (2.44) 

It  is  clear  that  (F,H)  = (a,l)  is  observable  (and  therefore 
detectable)  for  all  values  of  a.  Let  us  now  consider  the  following 

three  possible  cases. 

lal  < 1 : Here,  it  is  clear  that  (F.Q^)  - (a,0)  is 

stabilizable . Thus,  converges  to  the  stabilizing  solution  of  the  ARE, 
which  is  zero  in  this  case.  It  then  follows  from  Theorem  2.3  that  n£ 


Figure  2.1:  Behaviors  of  m^  and  m£  for  a = 0.85,  q°=  0.5. 
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converges  to  -q°/c  which  is  the  solution  of  the  ALE.  From  (2.41)  and 

(2.42),  one  can  indeed  see  that  lim  m.  = 0 and  lim  m,°  = -a°/c 

k-*»  Tc  k-«»  k H ' 

for  all  n > 0.  It  should  be  noted  that  the  limit  of  m^,  in  this  case, 
does  not  depend  on  the  actual  measurement  noise  covariance  r°,  as 
pointed  out  in  Remark  2.7.  This  fact  can  also  be  observed  from  Fig.  2.1 
which  depicts  behaviors  of  and  m£  for  a = 0.85  with  n = 1 and  some 
values  of  q°  and  r°. 

^ase — 2.>  lal  > 1 : ln  this  case,  (F,!^)  — (a,0)  has  no  unreachable 
mode  on  the  unit  circle.  We  shall  consider  cases  n > 0 and  ir  = 0, 
separately. 

(i)  For  n > 0,  condition  (ii)  of  Remark  2.5  is  satisfied.  Thus, 

m^  converges  to  the  stabilizing  solution  of  the  ARE  which  is  equal  to  c 
in  this  case.  By  Theorem  2.3,  m£  in  turn  converges  to  (cr°+  a^q°/c) 
which  is  the  solution  of  the  ALE.  The  above  statements  can  also  be 
verifi-ed  by  taking  the  limits  of  m^  and  m^  given  respectively  in  (2.41) 
and  (2.42).  Behaviors  of  and  m£  for  a = 1.5  with  n = 1 and  some 

values  of  q and  r are  shown  in  Fig.  2.2.  From  Fig.  2.2,  one  can  also 

observe  that  although  (q  — q)  and  (r  — r)  are  not  of  the  same  sign,  as 

required  in  Theorem  2.1,  m£  can  be  less  or  greater  than  m^  for  all 
k > 0,  depending  on  the  values  of  q°  and  r°. 

(ii)  For  7r  — 0 , all  conditions  assumed  in  Theorem  2.5  are 

satisfied.  Thus,  m^  4 « as  k -*■  °°  . One  can,  in  fact,  observe  from 

(2.42)  that  m^  — (a  — l)q  /c  which  approaches  infinity  exponentially 

fast.  It  should  be  pointed  out  that  since  the  filter  is  designed  with 

no  system  noise  and  n — 0,  the  filter  will  ignore  the  measurements  y, 

yk 

completely.  The  estimate  j ^ i-n  this  case,  is  simply  the 

V 


expectation  of 
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Case_3,  |a|  = 1 : Here,  it  can  be  seem  from  (2.43)  that  lim  m,  = 0 

k-KO  1 

for  7T  > 0,  and  V k > 0 for  it  = 0.  Based  on  Theorem  2.6,  one 

then  can  conclude  that  m£  -»  « as  k -*■  ® . Using  (2.44),  one  can 
actually  show  that  m£  approaches  infinity  with  the  same  rate  as  k. 
Behaviors  of  m^  and  m^  for  |a|  =1  with  n = 1 and  some  values  of  q°  and 
r°  are  depicted  in  Fig.  2.3.  For  small  q°  (e.g.  q°  - 0.1),  it  can  be 
seen  from  the  figure  that  the  actual  one -step  predictor  error  covariance 
does  decrease  as  more  measurements  are  processed  for  a certain  period. 
But  after  that  period  expires,  the  actual  one-step  predictor  error 
covariance  increases  almost  linearly  as  k increases. 
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Figure  2.3:  Behaviors  of  m^  and  m£  for  |a|  - 1,  r°  - 2. 

2 . 5 Concluding  Remarks 

Behavior  of  the  discrete- time  Kalman  filter  under  incorrect  noise 
covariances  has  been  analyzed  in  this  chapter.  The  filter  performance 
is  quantified  by  the  actual  one-step  predictor  error  covariance. 
Through  this  quantity,  the  characteristic  of  the  filter  has  been 
studied.  Particular  emphasis  has  been  given  to  convergence  and 
divergence  properties  of  the  actual  one -step  predictor  error  covariance. 

Convergence  of  the  actual  one-step  predictor  error  covariance, 
which  is  often  used  as  a part  of  assumptions  in  some  of  literature,  has 
been  established.  In  addition,  it  has  been  shown  that  under  certain 
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conditions,  the  Kalman  filter  is  asymptotically  optimal  even  though  the 
noise  covariances  used  in  designing  the  filter  are  incorrect.  The 
residual  sequence  is  then  asymptotically  white  under  those  conditions. 
Bounds  on  the  asymptotic  filter  performance  have  been  derived  when  the 
range  of  errors  in  noise  covariances  is  known. 

Divergence  of  the  filter  has  been  investigated.  It  has  been 
shown,  in  particular,  that  if  the  state  matrix  has  an  unreachable  mode 
outside  or  on  the  unit  circle,  then  incorrect  values  of  the  system  noise 
covariance  may  cause  the  filter  to  diverge.  On  the  other  hand,  if  the 
filter  is  designed  with  only  errors  in  the  measurement  noise  covariance 
and  the  system  is  detectable,  then  filter  divergence  can  never  occur. 

The  contribution  of  the  results  presented  here  is  that  they  help 
one  to  understand  and  be  able  to  predict  certain  behavior  of  the  Kalman 
filter  when  incorrect  values  of  the  noise  covariances  are  used.  This 
is,  of  course,  important  since  the  exact  values  of  the  noise  covariances 
are  hardly  known  in  most  practical  cases.  Continuous- time  counterparts 
of  the  results  presented  here  will  be  discussed  in  the  next  chapter. 


CHAPTER  III 

BEHAVIOR  OF  THE  CONTINUOUS -TIME  KALMAN  FILTER 
UNDER  INCORRECT  NOISE  COVARIANCES 

In  this  chapter,  behavior  of  the  continuous- time  Kalman  filter 
under  incorrect  noise  covariances  is  analyzed.  The  filter  performance 
is  quantified  by  the  actual  state  error  covariance.  Similar  to  the 
previous  chapter,  Chapter  III  consists  of  two  main  parts,  convergence 
and  divergence  analyses.  In  the  former  part,  useful  results  concerning 
the  convergence  of  the  actual  state  error  covariance  are  given. 
Situations  in  which  the  actual  state  error  covariance  is  unbounded  are 
considered  in  the  latter  part.  In  particular,  we  generalize  the 
divergence  results  given  in  Fitzgerald  (1971).  The  results  presented  in 
this  chapter  are  based  on  those  given  in  Sangsuk-Iam  and  Bullock 
(1987b) . 

3 . 1 Preliminaries 

Consider  the  stochastic  dynamical  system  described  by 


x(t) 

F(t)  x(t)  + w(t) 

(3.1) 

y(t)  = 

H(t)  x(t)  + v(t) 

(3.2) 

where  x(t)  and  y(t)  denote  the  state  and  the  measurement,  respectively. 
It  is  assumed  that  system  noise  process  (w(t),  t>0)  and  measurement 
noise  process  (v(t) , t>0)  are  zero-mean  white  noise  processes  which  are 
uncorrelated  with  one  another  and  with  x(0) . The  autocovariances  of 
w(t),  and  v(t)  are  given  by 

COV(w(t) , w(s) ) = Q°(t)6(t— s)  , COV(v(t) , v(s) ) = R°(t)5(t— s)  (3.3) 
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where  S(t)  denotes  the  Dirac  delta  function.  For  convenience,  intensity 
matrices  Q (t)  and  R (t)  will  be  referred  to  as  system  and  measurement 
noise  covariances,  respectively. 

The  linear  minimum  variance  estimate  of  x(t)  given  (y(s) , 0<s<t) 
can  be  obtained  from  the  standard  Kalman  filter.  Let  us  now  consider 
the  case  that  the  filter  has  been  designed  with  system  and  measurement 
noise  covariances  Q(t)  > 0 and  R(t)  > 0,  respectively,  instead  of  the 
actual  noise  covariances  Q°(t)  and  R°(t).  The  estimate  x(t|t)  of  x(t) 
given  (y(s),  0<s<t)  obtained  from  the  filter  is  then  no  longer  the 
linear  minimum  variance  estimate.  Nevertheless,  x(t|t)  is  still  an 
unbiased  estimate  of  x(t) . 

Let  P(t)  denote  the  state  error  covariance  computed  from  the 
filter,  i.e. , P(t)  satisfies  the  following  Riccati  differential  equation 

(RDE) : 

P(t)  = F(t)P(t)  + P(t)F'(t)  - P(t)H' (t)R~ 1(t)H(t)P(t)  + Q(t)  (3.4) 

where  P(0)  = II  which  denotes  the  covariance  of  x(0) . 

Equation  (3.4)  can  also  be  written  in  another  form  as  follows. 

P(t)  - F(t)P(t)  + P(t)F'(t)  + K(t)R(t)K' (t)  + Q(t)  (3.5) 

where  K(t)  :=  P(t)H' (t)R-1(t)  and  F(t)  :=  F(t)  - K(t)H(t) . 

The  actual  state  error  covariance  P°(t)  is  defined  as  the  error 

A 

covariance  associated  with  x(t|t)  ,i.e., 

P (t)  :=  C0V(x( t j t) , x( 1 1 1) ) where  x(t|t)  :=  x(t)  - x(t|t) 

It  can  be  shown  (Nishimura,  1967b)  that 
P°(t)  = F(t)P°(t)  + P°(t)F'(t)  + K(t)R°(t)K' (t)  + Q°(t) 
where  P°(0)  = n. 


(3.6) 
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The  following  useful  result,  which  can  be  shown  by  subtracting 
(3.6)  from  (3.5),  is  due  to  Nishimura  (1967b). 

Theorem  3.1:  If  Q(t)  > (<)  Q°(t)  and  R(t)  > (<)  R°(t)  for  all 
t > 0,  then  P(t)  > (<)  P°(t)  for  all  t > 0. 

The  effects  of  pessimistic  and  optimistic  designs  can  be  seen  by 
the  above  theorem.  For  instance,  if  the  upper  bounds  of  noise 
covariances  are  known  instead  of  the  their  actual  values,  the  filter  may 
be  designed  by  setting  the  noise  covariances  at  their  upper  bound.  This 
is  known  as  a pessimistic  design.  By  the  above  theorem,  the  actual 
state  error  covariance  P°(t)  is  therefore  bounded  from  above  by  the 
computed  state  error  covariance  P(t). 

From  (3.6),  P°(t)  can  be  expressed  as 
P°(t)  = «(t,0)Po(0)*'(t,0) 

t 

+ / ®(t,s)[K(s)R°(s)K'(s)-K30(s)]*'(t,s)ds  (3.7) 

where  tf(t,s)  denotes  the  state  transition  matrix  associated  with  F(t) , 
i.e.  , 

3*r(t.s)  - 

gt  = F(t)  #(t,s)  ; #(s,s)  - I (3.8) 

One  can  observe  from  (3.7)  that  P°(t)  is  uniformly  bounded  (from 
above),  i.e.,  suj>  ||P°(t)||  < ® , if  F(t)  is  exponentially  stable  and 

[K(t)R  (t)K'(t)+Q  (t)]  is  uniformly  bounded.  This  leads  to  the 
following  theorem  which  is  a slight  generalization  of  a result  given  by 
Jazwinski  (1970,  p.254). 

Theorem  3.2:  Let  F(t) , H(t) , Q(t),  R_1(t) , Q°(t) , and  R°(t)  be 

uniformly  bounded.  If  P(t)  is  uniformly  bounded  and  F(t)  is 
exponentially  stable,  then  P°(t)  is  uniformly  bounded. 


34 


Remark  3.1:  For  uniformly  bounded  F(t) , H(t) , Q(t) , and  R 1(t),  a 

sufficient  condition  for  which  P(t)  is  uniformly  bounded  is  that 
-h 

(F(t),R  (t)H(t) ) is  uniformly  completely  observable  (Anderson,  1971a). 
If,  in  addition,  (F(t) ,Q  (t))  is  uniformly  completely  controllable,  then 
F(t)  is  exponentially  stable  (Kalman,  1963).  It  should  be  noted  that 
uniform  detectability  and  uniform  stabilizability  for  continuous -time 
systems  may  be  defined  in  a similar  way  to  the  definitions  given  by 
Anderson  and  Moore  (1981)  for  discrete- time  systems.  The  above  uniform 
observability  and  uniform  controllability  hypotheses  may  then  be 
weakened  to  uniform  detectability  and  uniform  stabilizability, 
respectively. 

Remark  3.2:  In  some  practical  situations,  P°(t)  can  be 
intolerably  large  even  though  it  is  uniformly  bounded.  This  phenomenon, 
which  is  known  as  apparent  divergence,  has  been  investigated  by 
Fitzgerald  (1971) . 

For  time- invariant  systems  with  stationary  noise  processes,  it  is 
well  known  that  under  certain  conditions  (Kwakernaak  and  Sivan,  1972; 
Kucera,  1973;  Callier  and  Willems,  1981),  P(t)  converges  to  P which  is 
a solution  to  the  following  algebraic  Riccati  equation  (ARE) : 

FP  + PF'  - PH'R_1HP  + Q = 0 (3.9) 

Let  K :=  PH ' R and  F :=  F — KH.  If  P is,  in  addition,  such 
that  F has  all  its  eigenvalues  with  negative  real  parts,  then  P is 
called  the  stabilizing  solution  of  the  ARE. 

Remark  3.3:  The  stabilizing  solution  P of  the  ARE  exists  and  is 
necessarily  unique  iff  (F,H)  is  detectable  and  (F,Q^)  has  no  uncontrol- 
lable mode  on  the  imaginary  axis  (Kucera,  1972a;  Molinari,  1973). 
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The  convergence  of  P (t) , however,  has  not  been  established  in  any 
literature,  It  is  often  used  as  a part  of  assumptions  in  some  of 
literature,  see  e.g.  Mehra  (1970)  and  Toda  and  Patel  (1978).  With 
constant  F,H,Q  , and  R , it  follows  from  (3.6)  that  if  K(t)  converges  to 
K and  P°(t)  converges  to  P°,  then  P°  must  satisfy  the  following 
algebraic  Lyapunov  equation  (ARE) : 


FP°  + P°F'  + KR°K'  + Q° 


(3.10) 


In  the  following,  we  shall  study  certain  properties  of  P°(t)  for 
linear  time- invariant  systems  with  stationary  noise  processes.  Two  main 
properties  of  P (t) , convergence  and  divergence,  will  be  investigated 
separately  in  Sections  3.2  and  3.3. 


3 . 2 Convergence  Analysis 

A sufficient  condition  for  which  P°(t)  converges  is  established  in 
this  section.  Special  cases  in  which  the  noise  covariances  are  known  up 
to  an  unknown  scale  factor  are  considered.  These  special  cases  are 
used,  in  particular,  to  demonstrate  the  fact  that  under  certain 
conditions,  the  Kalman  filter  is  asymptotically  optimal  despite 
incorrect  values  of  noise  covariances.  The  residual  process  is  then 
asymptotically  white  under  those  conditions.  Thus,  the  whiteness  test 
on  the  residual  process  suggested  by  Mehra  (1970)  is  not  sufficient  to 
determine  whether  or  not  the  estimated  noise  covariances  are  adequate. 
Bounds  on  the  asymptotic  filter  performance  measured  by  the  trace  of  the 
limit  of  P (t)  are  also  given  when  the  Frobenius  norms  of  AQ  and  AR  are 
known. 

To  prove  the  convergence  of  the  actual  state  error  covariance 
P (t),  we  need  the  following  lemma. 
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111111  a — Let  A(t)  be  a square  matrix  whose  entries  are 
differentiable  functions  of  t for  all  t £ [o  , «>)  . Suppose  that  A(t) 
satisfies  the  following  conditions: 

(i)  Jig  A(t)  = A whose  all  eigenvalues  have  negative  real  parts, 
and  (ii)  lim  A(t)  = 0. 

t,-*» 

Then  A(t)  is  exponentially  stable,  i.e.,  ||*(t,s)||  < ae  ^(t  S)  for 

all  t>s>0  for  some  positive  numbers  a and  ft  , where  $(t,s)  denotes  the 
state  transition  matrix  associated  with  A(t). 

Proof:  Using  continuity  and  convergence  properties  of  A(t) , one 
can  easily  show  that 

||A(t)  | < « (3.11) 

Since  eigenvalues  of  A(t)  are  continuous  functions  of  its  entries 

(Kato,  1976)  and  A(t)  converges  to  A whose  all  eigenvalues  have  negative 

real  parts,  there  exist  positive  numbers  t and  a such  that  the  real 

o 

parts  of  all  eigenvalues  of  A(t)  are  less  than  or  equal  to  -a  for  all 
t>tQ  , i.e., 

Re  A. [A(t)]  < —a  Vi  and  t>t  (3.12) 

1 o ' 

The  convergence  of  A(t)  to  a zero  matrix  implies  that  for  any 

e>  0,  there  exist  a positive  number  t^  such  that 

sup  ||A(t)  ||  < £ (3.13) 

~ l 

Based  on  the  exponential  stability  result  for  slowly  time -varying 
systems  proven  by  Desoer  (1969)  (see  Theorem  A.l  given  in  Appendix  A), 
(3.11)  to  (3.13)  imply  that  there  exist  positive  numbers  t^  , 7 , and  ft 
such  that 

ll*(t,s)||  < 7e-^(t-s)  v t>s>t2  (3.14) 
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Since  t ^ is  finite,  there  is  a positive  number  f such  that 

ll*(t,s)||  < f V t2>t>s>0  . (3.15) 

It  follows  directly  from  (3.14),  (3.15),  and  the  semigroup 

property  of  the  state  transition  matrix  that  there  exists  a positive 
number  a such  that 

|| <E» ( t , s ) ||  < ae"^t_s)  V t>s>0  . ■ 

Remark — 3.4:  It  should  be  noted  that  the  convergence  of  A(t)  to  a 
constant  matrix  A,  in  general,  does  not  imply  that  the  limit  of  A(t) 
exists  and  equals  zero.  For  example,  a(t)  = (sin  t3  )/t  -+  0 as  t -»  <*> 
But,  a(t)  -*  co  as  t -►  «.  However,  if  in  addition  the  second  derivative 
of  A(t)  exists  and  is  bounded  on  [o  ,<*>),  then  lim  A(t)  = 0 (Rudin,  1976). 

Remark  3.5:  If  the  stabilizing  solution  P of  the  ARE  exists  and 
P(t)  converges  to  P,  then  F(t)  :=  (F  - K(t)H)  satisfies  all  of  the 
assumptions  stated  in  Lemma  3.1.  Hence,  F(t)  is  exponentially  stable. 
That  is,  if  at  steady-state  conditions  the  Kalman  filter  is 
exponentially  stable  as  a time -invariant  filter,  then  it  must  be 
exponentially  stable  as  a time-varying  (but  asymptotically  time- 
invariant)  filter. 

A sufficient  condition  for  which  P (t)  converges  can  now  be 
established  as  follows. 

Theorem 3^_3:  Let  ( F , H)  be  detectable  and  (F , Q^)  have  no 

uncontrollable  mode  on  the  imaginary  axis.  Suppose  that  P(t)  converges 
to  P which  is  the  stabilizing  solution  of  the  ARE,  then  P°(t)  converges 
to  P°  which  is  the  unique  solution  of  the  ALE. 
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Proof:  Under  the  above  assumptions  on  F,  H,  and  Q,  the  stabilizing 
solution  P of  the  ARE  exists  and  is  unique.  Detectability  of  (F,H)  also 
implies  that  P(t)  is  uniformly  bounded.  As  a consequence  of  Lemma  3.1 
which  is  stated  in  Remark  3.5,  F(t)  is  exponentially  stable.  Hence, 
P (t)  is  uniformly  bounded  by  Theorem  3.2.  Because  of  the  exponential 
stability  of  F,  there  exists  a unique  solution  P°  to  the  ALE. 

Let  A(t)  :=  P°(t)  - P°.  Then, 


A(t)  = F(t)P°(t)  + P°(t)F'(t)  + K(t)R°(t)K' (t)  + Q° 

- FP°  - P°F'  - KRV  - Q°  (3.16) 

Since  K(t)  :=  P(t)H'R  is  uniformly  bounded  and  converges  to  K, 
K(t)  can  be  expressed  K(t)  = K + L(t)  where  L(t)  is  uniformly  bounded 
and  converges  to  a zero  matrix  as  t approaches  infinity.  Equation 
(3.16)  can  then  be  rewritten  as 


A(t)  = FA(t)  + A(t)F'  + 0(t)  (3.17) 

where  6( t)  :=  -L(t)HP°(t)  - P°(t)H'L'(t)  + L(t)R°K'(t)  + KR°L' (t)  . 

Observe  that  6 (t)  is  uniformly  bounded  and  lim  6 (t)  = 0.  From 


(3.17),  A(t)  can  be  written  as 


A(t)  = eFtA(0)  eF>t 


+ f eF(t  s)0(s)  eF'(t  s)ds  (3.18) 


Because  of  the  exponential  stability  of  F,  the  first  term  on  the 
right  hand  side  (RHS)  of  (3.18)  goes  to  zero  as  t approaches  infinity. 
Let  us  now  prove  that  the  second  term  on  RHS  of  (3.18)  also  goes  to 
zero  as  t approaches  infinity. 

II  J e ^ ^6(s)  e ^ ^ds||<  / ae  ||#  (s)  ||ds  for  some  a>0  and  /}> 0 


= / ae  S'>  ||  (9  (s)  ||ds  + f ae  S^||0(s)| 


ds 
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< rae 


; ^(t  °sS>?  ^(S)II  + f ae  ^(t  S>lk(3)| 


ds 


-0(t-s) 


But  Jig  f ae  "'ds  < oo  and  lim  0(t)  = 0.  It  then  follows 
o 

that  for  any  e > 0,  there  exist  positive  numbers  T and  T with  T>T  such 

o o 

that  for  all  t>T, 

To  -^(t-To)su?||^(s)||  < c/2  and  f ae~^(t~s)  || ^ ( s ) || ds  < c/2. 


t — 


Thus,  ||  / eF<t-s)>(s)  eF'(t-s>ds| 


for  all  t > T 


This  in  turn  implies  that  the  second  term  on  RHS  of  (3.18)  goes  to 
zero  as  t approaches  infinity.  Consequently,  lim  P°(t)  = P°  . ■ 


t -+co 


Remark  3.6:  In  case  that  (F,H)  is  detectable  and  (F.Q^)  has  no 

uncontrollable  mode  on  the  imaginary  axis,  P(t)  converges  to  the  unique 
stabilizing  solution  P of  the  ARE  if  one  of  the  following  conditions 
holds : 

h 

(i)  (F.Q  ) is  stabilizable  and  P(0)  > 0 (Kwakernaak  and  Sivan, 
1972) . 

(ii)  P(0)  > P (Pachter  and  Bullock,  1977;  Poubelle  et  al.,  1986). 

(iii)  P(0)  > 0 (see  Theorem  C.l  given  in  Appendix  C) . 


Remark — 3_^7:  From  the  proof  of  Theorem  3.3,  it  is  clear  that  the 
theorem  still  holds  even  though  P(0)  / n,  i.e.,  an  incorrect  initial 
state  error  covariance  is  used  in  computing  P(t) . 

Corollary — 3_J.:  Suppose  that  F has  all  eigenvalues  with  negative 
real  parts.  Then  P°(t)  converges  to  P°  which  is  the  unique  solution  of 
the  ALE . 


Proof:  Here,  it  is  obvious  that  (F,H)  is  detectable  and  (F,Q^)  is 
stabilizable.  Consequently,  P(t)  converges  to  the  stabilizing  solution 
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of  the  ARE.  It  then  follows  from  Theorem  3.3  that  P°(t)  converges  to 
the  unique  solution  of  the  ALE.  H 

Remark  3.8:  If  the  assumption  of  Corollary  3.1  holds  and  the 

filter  is,  in  addition,  designed  by  assuming  there  is  no  system  noise, 
i.e.,  Q = 0,  then  the  computed  state  error  covariance  P(t)  converges  to 
a zero  matrix.  Hence,  the  actual  state  error  covariance  P°(t)  converges 
to  P which  satisfies  FP  + P°F'+  Q°  = 0 and  is  clearly  independent  of 
R° . 

Let  y(t|t)  :=  y ( t)  - H x( t | t)  (3.19) 

The  process  (y(t|t),  t>0)  will  be  referred  to  as  residual 

process.  Let  us  now  define 

C(t,h)  :=  C0V(y(t| t) ,y(t— h| t-h))  , t>h>0  (3.20) 

If  the  correct  values  of  noise  covariances  were  used  in  designing 
the  filter,  (y(t|t),  t>0}  would  be  a zero -mean  white  noise  process  with 
C(t,h)  = R 5(h)  (Kailath,  1968).  Because  of  the  incorrect  values  of 
noise  covariances,  the  residual  process  is  not  white.  But,  the  process 
still  has  mean  zero. 

Equation  (3.19)  can  also  be  written  as 

y(t|t)  = H x(t|t)  + v(t)  (3.21) 

It  is  straightforward  to  show  that  the  estimation  error  x(t|t) 
satisfies  the  following  stochastic  differential  equation. 

x(t|t)  - F(t)  x(t|t)  - K(t)  v(t)  + w(t)  (3.22) 

Consequently,  x(t|t)  can  be  expressed  in  terms  of  stochastic 
integrals  as 

t t 

= tf(t,0)x(0|0)  - f 'f(t,s)K(s)v(s)ds  + f tf(t,s)w(s)ds  (3.23) 

0 o 


x(t| t) 
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Using  (3.21)  and  (3.23),  one  can  show  that 
C(t.h)  - 


HP°(t)H'  - [HK(t)R°  + R°K' (t)H' ]/2  + R°6(0)  ; h = 0 


H *(t,t-h)[P°(t-h)H'  - K(t-h)R°]  + R°6(h)  ; t>h>0 


(3.24) 


Remark  3.9:  Observe  that  if  R = R°  and  Q = Q°,  then  K(t-h)R°  is 
equal  to  P (t— h)H'  for  t>h>0.  In  such  a case,  one  can  observe  from 
(3.24)  that  C(t,h)  = R 5(h).  The  process  (y(t|t),  t>0}  is  therefore 
white,  as  pointed  out  earlier,  and  it  is  commonly  referred  to  as 
innovations  process. 


Based  on  Theorem  3.3  and  the  above  equation,  the  limit  of 
autocovariances  C(t,h)  of  the  residual  process  can  be  expressed  as 
follows . 


Corollary — 3_. ,2 : Subject  to  the  same  conditions  given  in  Theorem 


3.3, 


C,  :=  lim  C(t,h)  = - 

h t — «o  ' ’ ' 


HP°H'  - (HKR°  + R°K'H')/2  + R°5(0);  h = 0 


. H eFh(P°H'  - KR°)  + R°5 (h) 


h > 0 


(3.25) 


It  follows  from  (3.10)  and  (3.25)  that  the  limiting 
autocovariances  of  the  residual  process  have  their  entries  linearly 
related  to  the  entries  of  Q and  R . This  useful  fact  was  applied  by 
Mehra  (1970)  to  identify  the  actual  noise  covariances  Q°  and  R°  based  on 


the  estimates  of  C.  . 

h 

Remark — 3 . 10 : The  result  given  in  Corollary  3.2  can  be  further 

simplified  in  case  that  F has  all  its  eigenvalues  with  negative  real 
parts,  and  Q = 0.  for  such  a case,  the  limit  of  C(t,h)  always  exists 
and  is  given  by 

Ch  - H e^pV  + R°5  (h)  ; h > 0 

where  FP°  + P°F'  + Q°  = 0 . 
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Certain  structures  of  the  actual  noise  covariances  may  be  known  in 
some  practical  cases  (Burg  et  al.,  1982;  Iglehart  and  Leondes,  1974). 
In  particular,  the  noise  covariances  may  be  known  up  to  an  unknown  scale 
factor.  That  is,  the  actual  noise  covariances  are  of  the  form  Q°  = a.Q 
and  R = /J.R  where  a and  /3  are  unknown  positive  scalars.  The 
following  result  can  be  obtained  for  this  particular  case. 

Proposition — 3_J.:  Subject  to  the  same  conditions  given  in  Theorem 
3.3,  and  Q = a.Q,  R°  = ^.R  for  some  positive  scalars  a and  p,  then 

(i)  P°  = p.R  + (a  - p) .S 

(ii)  Ch  = (a  - p).H  e^  Sq  H'  + p .RS (h)  ;h>0 

where  S is  the  unique  solution  of  FS  +SF'+0  - 0 

q q q 

Proof:  (i)  Under  the  conditions  given  in  Theorem  3.3, 

P (t)  converge  respectively  to  P and  P , and  F is  exponentially  stable. 
It  then  follows  from  (3.9),  (3.10),  and  the  exponential  stability  of  F 
that 


00  — 


P 

= 

f eFt(KRK'  + Q)  eF,tdt 
0 

(3.29) 

P° 

= 

oo  — 

f eFt(p.KRK'  + a.Q)  e^dt 
0 

(3.30) 

Hence , 

P° 

= 

00  — — 

P.P  + (a  — p) . f eFt  Q eF  tdt 
0 

Because 

F 

is  exponentially  stable,  the  solution  of 

(3.28)  is 

unique 

and  can 

be 

written  as 

S 

q 

= 

r°°  Ft  F't, 

J e Q e dt 

n 

Consequently,  P°  = p.P  + (a  - P) .S 


(3.26) 

(3.27) 

(3.28) 
P(t)  and 
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(ii)  Here,  KR°  = ;9.PH'  and  P°  = /S.P  + (a  - /}).S  . Thus, 


P°H'  - KR°  = (a  - /S).S^H' 


(3.31) 

It  follows  immediately  from  (3.25)  and  (3.31)  that  (3.27)  holds.  ■ 


Let  use  now  consider  a special  case  in  which  a = /?  . Some 
interesting  behavior  of  the  Kalman  filter  is  revealed  for  this  special 
case.  In  particular,  it  will  be  seen  that  the  Kalman  filter  can  be 
asymptotically  optimal  despite  incorrect  values  of  a . Thus,  as  far  as 
the  asymptotic  filter  performance  is  concerned,  the  exact  value  of  a is 
not  needed. 

Corollary  3.3:  Subject  to  the  same  conditions  given  in  Proposition 
3 . 1 and  a = /3  , then 

A 

(i)  The  estimate  x(t|t)  obtained  from  the  Kalman  filter  using 
noise  covariances  Q and  R is  asymptotically  optimal,  i.e., 


lim  P (t) 

t->®  ' 


lim  £(t) 

t-»oo  v ' 


where  2(t)  denotes  the  minimum  state  error  covariance  for  linear 
filters  when  a is  given. 

and  (ii)  The  residual  process  is  asymptotically  white,  i.e.,  C^=  R°5(h). 
Proof:  (i)  We  first  observe  that 

S(t)  = F£(t)  + £(t)F'  - £(t)H' (a.R)-1HE(t)  + a.Q  ; £(0)  = n (3.32) 


-1 


Thus,  a 1.S(t)  and  P(t)  satisfy  the  same  RDE  with  initial  values 
a .n  and  II  , respectively.  But  P(t)  converges  to  P which  is  the 
stabilizing  solution  of  ARE.  It  can  be  shown  (see  Theorem  C.2  given  in 
Appendix  C)  that  a 1.2(t)  also  converges  to  P since  its  initial  value 
is  different  from  the  initial  value  of  P(t)  by  only  a scale  factor. 
Hence,  lim  £(t)  = a.P  . From  (3.26)  with  a = p,  lim  P°(t)  = a.P.  Thus, 

t.-'w  ' t 
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the  filter  is  asymptotically  optimal. 

(ii)  It  follows  immediately  from  (3.27)  that  the  residual  process 
is  asymptotically  white  for  a = p . m 

In  Mehra  (1970),  a whiteness  test  on  the  residual  process  was 
suggested  to  decide  whether  or  not  the  estimated  noise  covariances  were 
adequate,  in  which  the  filter  was  assumed  to  have  reached  steady- state 
conditions.  It  is  clear  form  Corollary  3.3  that  the  whiteness  test  is 
insufficient  for  the  purpose  described  above.  The  estimated  noise 
covariances  may  be  quite  different  from  the  actual  noise  covariances 
although  the  whiteness  test  is  passed. 

It  is  important  to  observe  that  different  noise  covariances  can 
lead  to  the  same  optimal  steady-state  gain.  But,  the  steady-state 
autocovariances  of  the  residual  processes  corresponding  to  different 
noise  covariances  may  be  quite  different.  Corollary  3.3  can  be  used  as 
an  example  demonstrating  the  above  fact. 

In  general,  Q = Q + AQ  and  R = R°  + AR  where  AQ  and  AR  are  not 
completely  known.  In  some  cases,  one  may  have  information  about  certain 
norms  of  AQ  and  AR.  Based  on  this  information,  certain  structures  of 
the  limit  of  P (t)  can  be  concluded.  The  case  in  which  the  Frobenius 
norms  of  AQ  and  AR  are  known  has  been  investigated  by  Toda  and  Patel 
(1978).  In  particular,  Toda  and  Patel  (1978)  have  derived  the  lower  and 
upper  bounds  on  the  limit  of  tr(P  (t))  based  on  the  information  about 
II AQ || f and  ||ar||^,  in  which  F is  assumed  to  be  exponentially  stable.  It 
is,  however,  clear  from  Theorem  3.3  that  the  limit  of  P (t)  may  exist 
even  though  F is  not  exponentially  stable.  Thus,  the  result  proven  by 
Toda  and  Patel  (1978)  can  be  generalized  as  follows. 
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Theorem  3.4:  Subject  to  the  same  conditions  given  in  Theorem  3.3, 


tr(P)  - 

p < tr(P°) 

< tr(P)  + p 

(3.33) 

where 

P :=  | K ' MK  ||  f . 

IMIf  + ||M||f. 

Nllf 

(3.34) 

and 

M is  the  unique 

solution  to  F1 

’M  + MF  = I 

(3.35) 

Proof:  Under  the  conditions  given  in  Theorem  3.3,  P(t)  and  P°(t) 
converge,  respectively,  to  P and  P°,  and  F is  exponentially  stable. 

From  FP°  + P°F'  + K(R  - AR)K'  + (Q  - AQ)  - 0. 

Thus,  FP°  + P°F'  = — (KRK'  + Q)  + (KARK'  + AQ)  (3.36) 

Let  T :=  (In®F  + FSI^)  , i.e.,  the  Kronecker  sum  of  F with  itself. 
Then,  (3.36)  can  be  rewritten  as 

T vec(P°)  - — vec(KRK'  + Q)  + vec (KARK'  + AQ)  (3.37) 

Since  F is  exponentially  stable,  T has  all  its  eigenvalues  with 
negative  real  parts.  This  in  turn  implies  that  T is  invertible.  Hence, 

vec(P°)  = -T  1vec(KRK'  + Q)  + T^vec  (KARK'  + AQ)  (3.38) 

By  performing  similar  algebraic  manipulations  to  those  given  in 
the  proof  of  Theorem  2.4,  one  can  then  show  that 

tr(P°)  = tr(P)  + (vec(K'MK) ) ' vec(AR)  + (vec(M) ) ' vec(AQ)  (3.39) 

Using  (3.39)  and  identity  ||vec(A)||^  = ||a||^  , one  can  show  that 
(3.33)  holds.  m 

Remark  3.11:  It  should  be  noted  that  the  lower  bound  of  tr(P°) 

given  in  (3.33)  is  useful  only  when  tr(P)  > p since  tr(P°)  is  always 
nonnegative . 


Remark — 3.12:  In  case  that  F has  all  eigenvalues  with  negative  real 
parts  and  Q - 0,  P°(t)  converges  to  P°,  the  unique  solution  of  the  ALE, 
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whose  trace  satisfies  tr(P°)  < ||M||f.  ||AQ||f  where  M is  the  unique 
solution  to  F'M  + MF  - I . This  is  due  to  the  fact  that  K is  a zero 
matrix  in  this  particular  case. 

3.4  Divergence  Analysis 

In  this  section,  we  investigate  situations  in  which  the  filter 
diverges,  i.e.,  the  actual  state  error  covariance  P°(t)  is  unbounded. 
In  particular,  we  are  interested  in  filter  divergence  caused  by 
incorrect  values  of  noise  covariances.  For  detectable  (F,H),  it  is 
clear  from  Theorem  3.3  and  Remark  3.6  that  incorrect  values  of  noise 


stabilizable . Thus,  it  is  necessary  that  F has  at  least  one 
uncontrollable  mode  associated  with  an  eigenvalue  whose  real  part  is 
nonnegative  if  the  divergence  occurs.  A special  case  in  which  F has 
uncontrollable  modes  associated  with  zero  eigenvalues  has  been 
previously  investigated  by  Fitzgerald  (1971). 

Let  us  now  prove  the  following  two  lemmas  which  are  useful  in 
establishing  conditions  for  which  P°(t)  diverges. 

Lemma  3.2:  Let  z be  a row  vector  with  appropriate  dimension. 

If  zP(O)  - zQ  - 0,  then  zP(t)  =0  V t 6 (o,®). 

Proof:  Since  P(t)  satisfies  the  RDE, 


covariances  cannot  cause  the  filter  to  diverge  if  (F.Q^)  is 


zP(0)z*  = z[  FP(0)  + P(0)F'  - P(0)H'R  1HP(0)  + Q ]z 


-1, 


* 


where  z denotes  the  complex  conjugate  transpose  of  z. 


It  is  then  clear  that  zP(0)z*  - 0 . This  in  turn  implies  that 


zP(t)  = 0 for  all  t e [o ,«)  since  zP(0)z*  = 0. 


The  above  lemma  simply  indicates  that  if  the  filter  is  designed  by 
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assuming  that  a certain  linear  combination  of  the  states  is  initially 
known  and  is  not  excited  by  the  system  noise,  i.e.,zP(0)  = zQ  = 0,  then 
that  linear  combination  of  the  states  is  considered  to  be  exactly  known 
for  all  time  instances,  i.e.,  zP(t)  = 0 for  all  t e [o  ,«) 

Lemma — 3.3:  Let  P satisfy  the  ARE,  A be  a purely  imaginary  number, 
and  z be  a,  possibly  complex  valued,  row  vector  of  appropriate 
dimension. 

Then  zF  = Az  if  and  only  if  zF  = Az  and  zQ  = 0 . 

In  such  a case,  necessarily  zK  = 0 . 

Proof 

Sufficiency:  Since  P satisfies  the  ARE, 

z [ FP  + PF'  - PH'R_1HP  + Q ]z*  = 0 (3.40) 

But  zF  = Az  and  zQ  = 0 with  A = jw  for  some  real  number  u>. 
Equation  (3.40)  then  becomes 

zPH'R  ^HPz*  = 0 (3.41) 

Hence,  zK  = zPH'R  = 0 . Consequently,  zF  = Az. 

Necessity:  Since  P satisfies  the  ARE, 

z [ FP  + PF'  + KRK'  + Q ]z*  = 0 (3.42) 

Since  zF  = Az  with  A = jw,  (3.42)  reduces  to 

"Jc  "jc 

zKRK'z  + zQz  = 0 (3.43) 

* * 

Thus  zKRK'z  = zQz  = 0 because  both  terms  are  nonnegative. 

Hence,  zQ  = 0 , and  zK  = 0 since  R > 0 . Thus,  zF  = Az.  ■ 

Remark 3,13:  It  should  be  noted  that  the  proof  of  the  above 

theorem  is  similar  to  that  of  Theorem  1 given  in  Fitzgerald  (1971), 
except  here  we  are  not  assuming  that  A = 0. 
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In  words,  Lemma  3.3  indicates  that  if  F has  an  eigenvalue  A on  the 
imaginary  axis  with  associated  left  eigenvector  z in  the  left  null  space 
Q>  then  F has  A as  its  eigenvalue  with  the  same  associated  left 
eigenvector  z,  and  conversely. 

The  conditions  for  which  P (t)  diverges  can  now  be  established  as 
follows . 

Theorem — 3_.  5 : Suppose  that  F has  an  eigenvalue  A with  nonnegative 
real  part,  and  its  associated  left  eigenvector  z is  such  that 
zP(O)  = zQ  = 0.  If  zQ°  ^ 0 , then  zP°(t)z*  -*•  « as  t -►  » 

Proof:  It  follows  from  Lemma  3.2  that  zP(t)  =0  , ¥ t e [o,«>). 
Consequently,  zK(t)  = zP(t)H'R_1  = 0 , V te  [o,®). 

But  zP°(t)z*  = z[  F(t)P°(t)  + P°(t)F'(t)  + K(t)R°K'(t)  + Q°  ] z* 

Thus,  zP°(t)z*  = 2 Re  A.zP°(t)z*  + zQ°z*  (3.44) 

Since  Re  A > 0 and  zQ  z > 0 , it  is  clear  form  (3.44)  that 

o * 

zP  ( t ) Z ->  00  as  t — ► 00  . g 

Observe  that  zP°(t)z*  = E[  | zx( 1 1 t) ) | 2 ] . The  above  theorem 

therefore  indicates  that  the  variance  of  a certain  linear  combination  of 

the  actual  estimation  error  may  become  unbounded  if  incorrect  values  of 

the  system  noise  covariance  are  used.  It  can  be  seen  from  (3.44)  that 

o * 

zP  (t)z  will  approaches  infinity  exponentially  fast  if  Re  A > 0 and 
linearly  fast  if  Re  A = 0 as  t approaches  infinity. 

One  of  the  conditions  assumed  in  Theorem  3.5  is  that  zP(0)  = 0. 
It  will  be  seen  from  the  next  theorem  that  the  divergence  of  P°(t)  can 
occur  even  though  zP(0)  / 0. 
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Theorem  3.6:  Subject  to  (i)  P(t)  converges,  and  (ii)  F has  an 

eigenvalue  A on  the  imaginary  axis  with  corresponding  left  eigenvector  z 
such  that  zQ  = 0.  If  ZQ°  / 0,  then  zP°(t)z*  -»  «.  as  t -►  « 

Proof:  Suppose  that  zP  (t)z  is  finite  as  t approaches  infinity. 

From  zP°(t)z*  = z[  F(t)P°(t)  + P°(t)F'(t)  + K(t)R°K' (t)  + Q°  ]z* 

Thus  lim  inf  zP°(t)z*  > lim  inf  zF(t)P°(t)z*  + lim  inf  zP°(t)F» (t)z* 

+ lim  inf  zK(t)R°K' (t)z*  + zQ°z*  (3.45) 


If  follows  from  the  convergence  of  P(t)  and  Lemma  3.3  that 
lim  zF ( t)  = Az  and  lim  zK(t)  - 0 . 

t ~HXI  t -+oo 


It  is  then  clear  that  the  sum  of  the  first  three  terms  on  RHS  of 
(3.45)  is  equal  to  zero.  Hence,  lim  inf  zP°(t)z*  > zQ°z*  > 0.  This 
then  contradicts  the  hypothesis  zP  (t)z  is  finite  as  t approaches 
infinity. 

Therefore,  zP°(t)z*  -+  «>  as  t -+  « . ■ 


Remark  3.14:  It  should  be  pointed  out  that  conditions  (i)  and  (ii) 
of  the  above  theorem  can  simultaneously  hold  without  zP(0)  = 0.  If  this 
were  not  true,  the  result  of  Theorem  3.6  would  be  included  by  that  of 
Theorem  3.5.  Consider  for  example: 


0 1 

-1  0 


. H = [ 1 0 ] , Q - 0,  and  R - 1 

Here,  it  is  clear  that  condition  (ii)  holds.  Furthermore,  (F,H)  is 
detectable,  and  P = 0 is  the  strong  solution  of  the  ARE,  i.e.,  F has  no 
eigenvalues  with  positive  real  parts.  It  then  follows  from  Theorem  3 
given  in  Poubelle  et  al.  (1986)  that  for  any  P(0)  > 0 , lim  P(t)  = 0 

t 

In  particular,  if  P(0)  > 0,  there  is  no  row  vector  z such  that 

zP(0)  = 0. 
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So  far,  we  have  shown  that  incorrect  values  of  the  system  noise 
covariance  can  cause  the  filter  to  diverge.  The  next  logical  question 
that  we  shall  investigate  is  whether  or  not  filter  divergence  can  be 
caused  by  the  incorrect  measurement  noise  covariance  if  the  correct 
system  noise  covariance  is  used  in  designing  the  filter.  This  question 
was  raised  by  Fitzgerald  (1971).  Fitzgerald  showed,  in  particular,  that 
under  certain  conditions,  which  were  rather  restrictive,  incorrect 
values  of  the  measurement  noise  covariance  could  not  cause  the  filter  to 
diverge  if  Q — Q . Here,  a result  which  is  more  general  than  that  given 
by  Fitzgerald  (1971)  will  be  established.  Specifically,  it  will  be 
shown  that  if  (F,H)  is  detectable  and  Q = Q°,  then  filter  divergence 
not  occur  for  any  R,  which  has  been  assumed  to  be  positive  definite 
throughout  this  chapter.  Let  us  how  prove  the  following  theorem. 

Theorem  3.7:  Let  (F,H)  be  detectable  and  Q = Q°.  Then  P°(t)  is 
uniformly  bounded  for  any  R > 0. 

Proof : From  (3.5)  and  (3.6),  P(t)  and  P°(t)  can  be  written  as 

t 

P(t)  = *(t,0)IW'(t,0)  + J <Kt,s)K(s)RK' (s)«'(t,s)ds 

o 

t 

+ f 'I  ( t , s ) ' ( t , s ) ds  (3.46) 

o 

t 

P (t)  = *(t,0)IMr'(t,0)  + f tf(t,s)K(s)R°K' (s)*' (t,s)ds 

0 

t 

+ / #(t,s)Q%'  (t,s)ds  (3.47) 

o 

Since  (F,H)  is  detectable,  P(t)  is  uniformly  bounded.  Each 
individual  term  on  the  RHS  of  (3.46)  is  nonnegative  definite.  Thus, 
each  term  must  be  uniformly  bounded.  It  then  follows  from  (3.47)  that 
for  Q = Q°,  P°(t)  is  uniformly  bounded  if  the  second  term  on  the  RHS  of 
(3.47)  is  uniformly  bounded. 
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f *(t,s)K(s)R°K'  (s)*'  (t,s)ds||  = A (/  tf(t , s)K(s)R°K'  (s)tf' (t , s)ds) 

02  max 

o 

t 

< tr(  f Hr(t,s)K(s)R°K'(s)r  (t,s)ds) 

0 

t 

“ tr(R°  f K' (s)tf' (t,s)tf(t,s)K(s)ds) 


“ ) €(t)  (by  Lemma  2.4) 

IllaX 


where  £(t)  :=  tr(  f K' (s)tf' (t, s)tf (t, s)K(s)ds  ) 


We  claim  that  £(t)  is  uniformly  bounded.  Hence,  the  second  term 
on  the  RHS  of  (3.47)  is  uniformly  bounded.  P (t)  is  therefore  uniformly 
bounded.  To  prove  the  claim,  we  observe  that 

t 

tr(  / tf(t,s)K(s)RK' (s)tf' (t,s)ds  ) > A . (R)  ^(t)  > 0 . 

0 min  ^ 

Since  the  second  term  on  the  RHS  of  (3.46)  is  uniformly  bounded, 

its  trace  must  also  be  uniformly  bounded.  Furthermore,  A (R)  > 0 

min 

Thus,  £(t)  is  uniformly  bounded.  This  completes  the  proof  of  the 
theorem.  _ 


3.4  Illustrative  Examples 

To  illustrate  the  results  developed  in  Sections  3.2  and  3.3,  we 


consider  the  following 

scalar  system: 

x(t) 

= a.x(t)  + 

w(t) 

(3.48) 

y(t) 

x(t)  + 

v(t) 

(3.49) 

where  a is  a real  number,  and  the  autocovariances  of  x , w(t) , and  v(t) 
are  given  by 

C0V(x(0) ,x(0) ) = 7r , COV (w(t) ,w(s))  = q°5 ( t— s ) , C0V(v(t) , v(s) ) = r°5(t-s) 
with  7T  > 0 , q°  > 0,  and  r°  > 0 . 
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Let  us  now  consider  the  case  that  the  Kalman  filter  has  been 
designed  with  no  system  noise,  i.e.,  q = 0,  and  measurement  noise 
covariance  r = 1.  In  this  case,  the  calculated  state  error  covariance 
p(t)  and  the  actual  state  error  covariance  p°(t)  satisfy  the  following 
differential  equations: 

P(t)  ” 2a.p(t)  - p2(t)  ; p (0)  - n (3.50) 

p°(t)  = 2(a  - p(t))p°(t)  + p2(t)r°  + q°  ; p°(0)  = n (3.51) 

Solving  the  above  differential  equations,  we  obtain  that  for  t>0: 

(i)  For  a / 0 

p(t)  = 2a7re2at/(2a  + 7r(e2at-  1))  (3.52) 

p°(t)  - {4a27r  + 2a?r2(e2at-  l)r°  + [(2a  - tt)2(1  - e_2at)/2a 

+ 2?r(2a  - ir)t  + 7r2(e2at-  l)/2a]  q°)  e2at/(2a  + 7r(e2at-l) ) 2 (3.53) 

(ii)  For  a = 0 

p(t)  = 7r/(  Trt  + 1)  (3.54) 

p°(t)  = [n  + n2t  r°  + [ (jrt  + l)3  - 1]  q°/3?r)  /(  nt  + l)2  (3.55) 

Notice  that  (F,H)  = (a,l)  is  detectable  for  all  values  of  a.  Let 

us  consider  the  following  three  possible  cases. 

Case 1,  a < 0 : As  mentioned  in  Remark  3.8,  p(t)  and  p°(t) 

converge  respectively  to  zero  and  p°  satisfying  2ap°  + q°  = 0 . Indeed, 

one  can  observe  from  (3.52)  and  (3.53)  that  lim  p(t)  = 0 and 

t -*x> 

P (t)  = ~q°/2a.  Behaviors  of  p(t)and  p°(t)  for  a = -0.15  with  some 
values  of  q and  r are  shown  in  Fig.  3.1.  It  can  also  be  observed 
from  Fig.  3.1  that  the  limit  of  p (t) , in  this  case,  is  independent  of 
the  actual  measurement  noise  covariance. 
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Figure  3.1:  Behaviors  of  p(t)  and  p°(t)  for  a 0.15,  q°  = 0.5. 

Case 2,  a > 0 : It  is  clear  in  this  case  that  (F,Q  ) = (a,0)  has 

no  uncontrollable  mode  on  the  imaginary  axis.  Let  us  consider  cases  n > 
0 and  n = 0 separately. 

(i)  For  n > 0,  p(t)  converges  to  the  stabilizing  solution  of  the 
ARE  (see  condition  (iii)  of  Remark  3.6).  Here,  the  ARE  is  2ap  - p2  = 0. 
The  stabilizing  solution  is  then  given  by  p = 2a.  By  Theorem  3.3,  p°(t) 
converges  to  p°  satisfying  -2ap°  + 4a2r°  + q°  = 0.  From  (3.52)  and 
(3.53),  one  can  observe  that  lira  p(t)  = 2a  and  lim  p°(t)  = 2ar°  + q°/2a 

t “►QO  t -XX)  'If 

which  indeed  agree  with  the  above  statements.  Behaviors  of  p(t)  and 
p°(t)  for  a = 0.15  with  some  values  of  q°  and  r°  are  depicted  in 
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Fig.  3.2.  Figure  3.2  also  shows  that  p°(t)  can  be  less  or  greater  than 
p(t)  for  all  t > 0 even  though  (q  — q)  and  (r°  — r)  are  not  of  the  same 
sign,  as  required  in  Theorem  3.1. 

(ii)  For  7r  = 0,  all  conditions  assumed  in  Theorem  3.5  hold. 
Therefore,  p°(t)  -*  «o  as  t -»  « . In  fact,  p°(t)  = (e2at-  l)q°/2a  as 
obtained  from  (3.53).  Hence,  p°(t)  approaches  infinity  with  an 
exponential  rate. 


Figure  3,2:  Behaviors  of  p(t)  and  p°(t)  for  a = 0.15. 

Case  3,  a = 0 : It  is  clear  from  (3.54)  that  lim  p(t)  = 0 for  it  > 0 

t —►oo  r 

and  p(t)  = 0 V t > 0 for  n = 0.  Thus,  all  conditions  assumed  in  Theorem 
3.6  are  satisfied.  This  in  turn  implies  that  p°(t)  -*•  ® as  t -+  <*>  . In 
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fact,  one  can  observe  from  (3.55)  that  p°(t)  approaches  infinity  with 
the  same  rate  as  t.  Behaviors  of  p(t)  and  p°(t)  with  n = 1,  r°  = 2 and 
several  values  of  q are  shown  in  Fig.  3.3.  It  can  be  seen  from  Fig. 
3.3  that  for  small  q , e.g.,  q = 0.1,  the  actual  state  error 
covariance  p (t)  does  decrease  as  more  measurements  are  processed  for  a 
short  period.  But  for  large  t,  p (t)  increases  almost  linearly  as  t 
increases . 


Figure  3,3:  Behaviors  of  p(t)  and  p°(t)  for  a = 0,  r°  = 2 . 

3 . 5 Concluding  Remarks 

In  this  chapter,  we  have  analyzed  behavior  of  the  continuous -time 
Kalman  filter  under  incorrect  noise  covariances.  The  actual  state  error 
covariance  is  used  as  a quantity  representing  the  filter  performance. 
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Based  on  this  quantity,  the  characteristic  of  the  filter  has  been 
examined.  In  particular,  two  important  properties  of  the  actual  state 
error  covariance,  convergence  and  divergence,  have  been  investigated. 

A sufficient  condition  for  convergence  of  the  actual  state  error 
covariance  has  been  established.  It  is  shown,  in  addition,  that  under 
certain  conditions  the  Kalman  filter  is  asymptotically  optimal  despite 
incorrect  values  of  noise  covariances.  Under  these  conditions,  the 
residual  process  is  then  asymptotically  white. 

Situations  in  which  the  filter  diverges  has  been  investigated.  In 
particular,  it  has  been  shown  that  incorrect  values  of  the  system  noise 
covariance  may  cause  the  filter  to  diverge  if  the  state  matrix  has 
uncontrollable  modes  associated  with  eigenvalues  whose  real  parts  are 
nonnegative.  But  if  the  system  is  detectable,  incorrect  values  of  the 
measurement  noise  covariance  alone  cannot  cause  the  filter  to  diverge. 

It  can  be  seen  that  the  discrete  - time  and  continuous -time  results 
presented  respectively  in  the  previous  chapter  and  this  chapter  are  very 
similar.  In  most  cases,  one  can  immediately  write  down  what  the 
corresponding  continuous  - time  results  would  be  if  one  knows  the 
discrete- time  results,  and  vice  versa. 


CHAPTER  IV 

DIRECT  ESTIMATION  OF  NOISE  COVARIANCES 


In  some  practical  cases,  the  noise  covariances  are  unknown.  For 
such  cases,  one  might  want  to  estimate  the  unknown  noise  covariances  and 
use  the  estimated  noise  covariances  in  designing  the  Kalman  filter. 
Numerous  techniques  for  estimating  noise  covariances  have  been  proposed, 
see  for  example  Mehra  (1970,  1972),  Weiss  (1970),  Belanger  (1974),  Myers 
and  Tapley  (1976),  Tabuchi  et  al.  (1978),  Ohnishi  (1980),  Lee  (1980), 
Friedland  (1982)  and  Lee  et  al . (1985).  For  linear  time- invariant 
systems  with  stationary  noise  processes,  most  of  commonly  used 
techniques,  e.g.,  measurement  correlation,  residual  sequence 
correlation,  and  covariance  matching  techniques,  have  at  least  one  of 
the  following  drawbacks.  First,  the  estimation  of  the  state  is  required 
as  an  intermediate  step.  Second,  the  measurements  or  the  residual 
sequence  must  reach  its  stationary  stage  before  one  starts  estimating 
the  unknown  noise  covariances. 

In  Lee  (1980),  an  interesting  and  rather  direct  technique  was 
suggested.  By  utilizing  the  minimal  polynomial  of  the  state  matrix,  Lee 
showed  that  a certain  linear  combination  of  the  measurements  was 
stationary  although  the  measurements  themselves  were  not  stationary. 
Based  on  the  sample  autocovariances  of  that  linear  combination  of  the 
measurements,  the  unknown  noise  covariances  can  be  estimated.  Clearly, 
this  technique  does  not  possess  the  undesirable  drawbacks  mentioned 
above.  It  is,  however,  unfortunate  that  Lee  merely  introduced  his  idea 
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and  did  not  give  any  analyses  on  his  technique.  Particularly, 
convergence  of  the  estimated  values  of  the  noise  covariances  to  their 
actual  values  was  not  discussed. 

In  this  chapter,  we  shall  discuss  and  analyze  the  direct  technique 
suggested  by  Lee  (1980)  which  will  be  referred  to  as  the  stationary 
preprocessed  measurement  correlation  technique.  The  technique  is  direct 
in  the  sense  that  the  unknown  noise  covariances  can  be  estimated  without 
requiring  the  estimate  of  the  state  and  the  stationarity  of  the 
measurements . 

4- . 1 Stationary  Preprocessed  Measurements 

Consider  the  following  discrete  stochastic  system 

Vi  - Fxk  + wk  f4-1* 

yk  - « ^ + vk  (4.2) 

where  and  v^  are  zero -mean  independent  noise  processes  with  unknown 
covariances  Q and  R,  respectively.  It  is  also  assumed  that  noise 
processes  and  v^  are  mutually  independent. 

Given  measurements  {yk>  0<k<N},  we  would  like  to  identify  the 
unknown  noise  covariances.  The  identification  can  be  achieved  by 
extracting  the  necessary  information  about  the  unknown  noise  covariances 
from  the  measurements.  Because  the  measurements  are,  in  general, 
nonstationary,  difficulty  in  extracting  the  necessary  information 
arises.  This  difficulty  can,  however,  be  circumvented  by  preprocessing 
the  measurements  to  remove  nonstationary  components.  We  shall  refer  to 
these  preprocessed  measurements  as  stationary  preprocessed  measurements. 
It  should  be  pointed  out  that  the  preprocessing  technique  mentioned 
above  is  commonly  used  in  time  series  analysis;  see  Anderson  (1971), 
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Box  and  Jenkins  (1976),  and  Aoki  (1987).  For  example,  polynomial  trends 
of  a given  time  series  can  be  removed  after  a certain  number  of 
differencing.  Taking  difference  of  the  logarithms  of  the  time  series, 
on  the  other  hand,  can  remove  exponential  growth  trends. 

Two  different  approaches  for  obtaining  stationary  preprocessed 
measurements  have  been  proposed  by  Ohnishi  (1980)  and  Lee  (1980).  The 
approach  proposed  by  Ohnishi  is,  however,  unnecessarily  complicated. 
Futhermore,  the  dimension  of  the  stationary  preprocessed  measurements 
obtained  from  Ohnishi 's  approach  is  equal  to  the  dimension  of  the  state 
which  is  usually  greater  than  the  dimension  of  the  original 
measurements.  Lee's  approach,  on  the  other  hand,  is  much  simpler.  By 
utilizing  the  minimal  polynomial  of  the  state  matrix  F,  Lee  showed  that 
a certain  linear  combination  of  the  measurements  was  stationary  even 
though  the  measurements  themselves  were  not  stationary.  That  linear 
combination  of  the  measurements  was  then  used  as  a stationary 
preprocessed  measurement.  Clearly,  the  dimension  of  the  stationary 
preprocessed  measurements  obtained  from  Lee's  approach  remains  the  same 
as  the  dimension  of  the  original  measurements. 

Let  us  now  show  how  the  stationary  preprocessed  measurements  can 
be  obtained  by  utilization  of  the  minimal  polynomial  of  F.  Let  the 
minimal  polynomial  of  F be  given  by 

f(d)  = dm  + a dm_1  +...+  a (4.3) 

1 m 

From  (4.1)  and  (4.2),  we  have  that 

y,  = H x.  + v. 

k— m k— m k— m 

^k-m+1 


HF  x.  + H w,  + v, 

Tc-m  k-m  k-m+1 
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rk— m+2 


= HF 


V, 


m 


HF  w,  + 
k— m 


H w. 


k— m+1 


v. 


k— m+2 


IU  . - 

f,  - HFV  + E HFJ  w.  . + v, 

k k— m j = 1 k— j k 


Define  zfc  yfc  * a:  ,H  4-  . . . + yk_m  , k > m 

Since  f(F)  = 0 , it  follows  that 


(4.4) 


Ji  (HAi  Vi + ai  vk-i>  + vk 


(4.5) 


i - 1 


where  A.  :=  Z a F 
i j = o j 


i-j-l 


, i =l,2,..,m  with  a :=  1, 

o 


(4.6) 


Notice  that  can  be  computed  recursively  as 

Ai+1  “ FAi  + a^I  for  i - 1,2,.., m— 1 with  A^  — I . 

It  is  apparent  from  (4.5)  that  z^  is  (wide-sense)  stationary. 

Thus,  stationary  preprocessed  measurement  z,  is  obtained. 

k 


4 . 2 Estimation  of  Noise  Covariances 

In  this  section,  we  describe  how  one  can  utilize  the  correlation 
of  stationary  preprocessed  measurements  z to  obtain  the  estimates  of 
the  unknown  noise  covariances.  It  is  clear  from  (4.5)  that  the  process 
Zk  not  only  stationary  but  also  m- dependent,  i.e.,  z.  and  z.  are 
independent  if  | i— j | > m (further  discussions  on  m-dependent  time  series 
can  be  found  in  Hoeffding  and  Robbins  (1948)  and  Fuller  (1976)).  Hence, 
z k has  zero  autocovariances  after  time  lag  m. 

Let  Ch  C0V(  zk>  zk-h  } 

H(A  Q A'  + A Q A'  + . . . + A Q A')H' 
l 1 Z / m m 

+(l+a^+.  . .+a^)R 
1 m 


Then 
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- H(A  Q A'  + A Q A!  + . . . + A Q A'  )H 


+ (a  + a a + . . . + a a . 
1 z 1 m m-1 


m m-1 

) R 


C. 

l 


H(Ai+iQ  A{  + Ai+2Q  A2  + 


+ A Q A'  . )H' 
m m-i 


+ (at  + a.+iai  + . . 


+ a a . ) R 
m m-i 


m 


a R 
m 


(4.8) 


From  the  above  expressions  of  C,  , one  can  see  that  the  entries  of 

h 

^h’  an<^  R are  linearly  related.  Hence,  a set  of  linear  equations 

which  may  be  used  to  solve  for  Q and  R can  be  obtained  if  C , C , . . , C 

o ’ m 

—1 

are  given.  In  particular,  R = a .C  provided  that  a is  not  zero,  or 

m m m 

equivalently  F is  invertible  (Herstein,  1975) . 


Given  measurements  {y^  , 0<k<N) , the  autocovariances  of  may 


be  estimated  by 


A 


N 

i?m  + h ZiZi-h  )/(N-^h+1)  > h “ 0.1 m (4.9) 


The  above  equation  can  be  written  in  a recursive  form  as 


VN>  - 


CN-m-h’) 

(N-m-h+1) 


Ch(N-l) 


+ (N-m-h+1) 


ZN  ZN-h 


N > m+h 


(4.10) 


where  ^(j)  denotes  the  estimate  of  based  on  measurements 

{yk  , 0<k<j } . 

A 

Using  the  stationary  property  of  z,  , one  can  easily  show  that  C 

K n 

is  an  unbiased  estimate  of  C^.  If  the  number  of  linearly  independent 
equations  relating  the  entries  of  C^,  Q and  R is  not  less  than  the 
number  of  unknown  entries  of  Q and  R,  the  estimates  of  Q and  R can  be 
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uniquely  determined  from  C,  . We  shall  refer  to  the  technique  for 
estimating  the  noise  covariances  described  above  as  the  stationary 
preprocessed  measurement  correlation  (SPMC)  technique.  The  convergence 
property  of  the  SPMC  technique  will  be  discussed  in  the  next  section. 

Let  us  now  make  an  observation  that  a linear  combination  of  the 
measurements  possessing  stationarity  can  be  defined  by  using  any 
polynomial  p(d)  which  is  satisfied  by  the  state  matrix,  i.e.,  p(F)  = 0. 
As  well  known,  it  is  in  general  difficult  to  obtain  the  minimal 
polynomial  of  a given  matrix.  Therefore,  it  is  worthwhile  to 
investigate  whether  there  is  benefit  in  defining  z^  based  on  the  minimal 
polynomial  of  F. 

Let  p(d)  be  a polynomial  which  is  satisfied  by  F and  n(p)  be  the 
number  of  independent  equations,  with  respect  to  entries  of  Q and  R, 
which  are  obtained  from  autocovariance  equations  (4.8)  of  stationary 
preprocessed  measurement  defined  as  in  (4.4)  by  using  polynomial  p(d) 
instead.  The  benefit  of  defining  stationary  preprocessed  measurements 
based  on  the  minimal  polynomial  of  F is  revealed  by  the  following 
proposition. 


Proposition  4.1:  Let  f(d)  be  the  minimal  polynomial  of  F.  Then 

for  any  polynomial  p(d)  which  is  satisfied  by  F,  n(f)  > n(p) . 

Proof:  Since  f(d)  is  the  minimal  polynomial  of  F,  p(d)  can  be 

factored  as 

p(d)  = g(d).f(d)  (4.11) 

S 

for  some  polynomial  g(d)  = b.  dS_1  . 

Let  z^  and  z^  be  linear  combinations  of  y^  defined  as  in  (4.4)  by 
using  polynomials  f(d)  and  p(d),  respectively. 

From  (4.11),  z^  can  be  expressed  in  terms  of  z^  as 
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S b.  z.  . 
i - 0 1 k-i 


(4.12) 


Let  Ch  and  Ch  be  defined  as  COV(zk  ,z ^)  and  COV ( zu , z,  v) , 


respectively. 

Then 


Cn  " COV(  S b4  z,  , ,Sb,  z,,  u 4 ) 


i = 0 i k-i  ’ j = o j k-h-j 


k k-h 


2 2 b.  b.  C,  . . 

i - 0 j = 0 1 J h+J-1 


(4.13) 


That  is  can  be  written  as  a linear  combination  of  C^.  This  in 
turn  implies  that  n(f)  > n(p)  . ■ 


4 . 3 Convergence  of  Noise  Covariance  Estimates 

In  this  section,  we  investigate  the  convergence  property  of  the 
noise  covariance  estimates  obtained  from  the  SPMC  technique.  Since  the 

A 

estimate  values  of  noise  covariances  are  linear  functions  of  C,  their 

h 

convergence  to  the  actual  values  depends  thoroughly  on  the  convergence 

A 

of  C,  to  C,  . As  mentioned  earlier,  for  non-zero  a the  measurement 
h h m 

noise  covariance  can  be  estimated  by 

A -»  A 

R - a"  . C (4.14) 

mm  ' ' 

A A 

But  if  Cm  does  not  converge  to  C^,  R may  not  be  even  a symmetric 
positive  semi-definite  matrix.  For  such  a case,  the  estimate  of  R is  not 

A 

useful.  Hence,  the  establishment  of  the  convergence  of  is  essential. 
Let  us  now  prove  he  following  theorem  which  can  be  directly 

A 

applied  to  establish  the  convergence  of  C^. 

Theorem  4.1:  Let  {u^,  k = 0,1,..)  be  a stationary  and  m-dependent 

sequence  of  zero-mean  random  vectors  with  sup  E[u^  j ] < 00  , where  u^  . 

denotes  the  j th  component  of  u^.  Then 


N - 1 
2 u.  u! 


N-h  i=h  i i— h n 


in  quadratic  mean 
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where 


Rh  :=  cov<Vui-h  > 


Proof:  Without  loss  of  generality,  we  shall  prove  the  theorem  by 
using  the  induced  Euclidean  norm. 


N-  1 


Here, 


2 u.  u'  - 


Where  v 


N-h  i=h  i i-h 
2 


Rh 


sup 

IvlUl 


1 N~1 

— s U.  U'  - r,  y 
N-h  i = h i i-h  hi 


v' V . 


N-l 


1 2 

To  prove  lim  E [ II  tt-t  E u.  u!  , — R,  II  ] = 0 , it  is  sufficient 

N-*»  11  N-h  i=h  l i-h  Ti  M J ’ 

show  that  for  any  v with  ||v||  = 1, 


n - 1 


lim  E [ II  ( “r  E u.  u'  - R.  )v|| 2 ] 
n-**>  11  v N-h  i=h  i i-h  h 11  J 


(4.15) 


N - 1 


Ei  IK  » 

i,  E u. 
-h  i=h  1 

ur  . - 

l-h 

\ )v|h 

1 

N-  1 

N-  1 

(N-h)2 

E 

i=h 

s 

j = h 

V'  E [ (u£ 

ui-h-  V'(uj 

-h-  Vi 

V 

1 

N-l 

i-h 

(N-h)2 

E 

i = h 

E 

k=i -N+l 

v'  E[(Ui 

ui-h- V'(ui-k 

Ui-k-h  " 

Vi 

V 

1 

0 

N-  1 + k 

(N-h)2 

E 

k = h - N + 1 

E 

i = h 

v'  E[ (u± 

ul-h-  V'<Vk 

Ui-k-h  “ 

V) 

V 

1 

N-  1-h 

N-l 

X 

(N-h)2 

E 
k = 1 

E 

i =k  + h 

v'  E[(u. 

ui-h- V'<Vk 

Ui-k-h  “ 

Vi 

V 

Now  observe  that 

u.  u! 
l l 

and 
- n 

u.  . u'  , are 

l-k  l-k-h 

idependent  for 

| k | > m+h.  Thus 


e t (u.  u: 

i i 

-h  ‘ 

- V'(ui. 

-k  Ui-k-h 

Vi 

= 0 

for  | k | > 

m+h  . 

Thus , for 

N > 

J3 

CNI 

e 

E!  IK  its 

N-l 

E 

i =h 

u.  u'  - 

l l-h 

- \ )v||2] 

1 

0 

N - 1 + k 

(N-h)2  k 

E 

= - ra  - 

s 

•h  i=h 

V'  Ef(Ui 

Ul-h- 

- V ' <ui 

-k  ui-k-h  " 

■VI  v 

1 

m + h 

N-l 

+ (N— h)z 

s 

k = 1 

E 

i=k  + h 

v'  E[(u. 

"i-h' 

- V'(ui 

-k  ui-k-h  “ 

■VI  v 
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N-  1 


— /kt  un2  (m+h+1)  S max  Jv'  E[(u.  u'  — R,  ) ' (u  u'  — R.  )1  vl 
(N-h)  7i=h  -m-h<k<o  i i-h  Ti'  y i-k  i-k-h  J 1 


N-  1 


+ (N-h)2  <m+h)  Et<ui  "i-h  - V'("i-k  "i-k-h  * V1  V| 

But  E[(Ul  u:.h-  V'<"i-k  "i-k-h  - VI  - Et"i-h  "i  "i-k  "i-k-hj  - K V 
Based  on  the  above  equation  and  sup  E[u^  ^ ] < ®>  , one  can  show  that  each 

entry  of  E [ (u±  u[_h~  1^) ' (u±_k  u£_fe_h  - R^) ] is  finite  for  all  h,  k,  and 
i.  It  is  then  immediate  that 


E[  «<  i “i  "i-h  - “h  HI2]  s (itS) 


for  some  finite  number  c. 


Thus,  equation  (4.15)  holds.  This  in  turn  implies  that 


n - 1 


N-h  i?h  ui  u[-h  \ in  quadratic  mean- 


It  is  clear  from  the  above  theorem  that  the  sample  autocovariances 

A 

Ci  will  converge  in  quadratic  mean  to  the  actual  autocovariances  C,  as  N 
11  h 

4 4 

increases  if  sup  E[w,  ] and  sup  E[v.  .]  are  finite.  Under  the  same 
k,J  KiJ  k.O  k , J 

conditions,  the  estimates  of  the  noise  covariances  will  also  converge  in 
quadratic  mean  to  the  actual  values  as  N increases  since  the  estimates 

A 

ate  linear  functions  of  C^.  It  should  also  be  noted  that  convergence  in 
quadratic  mean  implies  convergence  in  probability  (Chung,  1974;  Wong  and 
Hajek,  1985).  Thus  , under  the  above  conditions  on  the  fourth  moments 
of  the  noise  processes,  the  estimates  of  the  noise  covariances  obtained 
from  the  SPMC  technique  are  not  only  unbiased  but  also  consistent. 


4.4  Unique  Determinabilitv  of  Noise  Covariances 

Given  C^,  the  noise  covariances  Q and  R can  be  uniquely  determined 
if  the  number  of  unknown  entries  of  Q and  R is  less  than  or  equal  to 
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n(f)  . It  is,  however,  important  to  observe  that  the  measurement  noise 
covariance  R can  always  be  uniquely  determined,  provided  that  the  state 
matrix  F is  invertible.  It  should  also  be  pointed  out  that  when  F is 
invertible,  H does  not  have  to  be  known  if  one  wants  to  determine  only 
R.  In  case  that  F is  singular,  one  may  or  may  not  be  able  to  determine 
R uniquely.  The  following  example  demonstrates  the  preceding 
statement. 


Example  4.1:  Consider  the  system  described  by  (4.1)  and  (4.2) 


with 


H 


A,  = 


C 


'0 

1 

1 

0 

0 " 

= 

0 

1 

2 

Q 

= 

o1  • 

% 

0 

_0 

0 

1_ 

_0 

o2 

C 

*3- 

1 

0 

0 

r. 

0 

0 

0 

1 

R 

o1 

r2-l 

I3  + a 

1 d 

2 

+ 

a2  d 

+ a3 

where 

al  = 

-2, 

a2  = 

i, 

and  a3  = 

' 1 

0 

O' 

'-2  1 

l" 

1 

-1  l" 

0 

1 

0 

A 

9 = 

0 -1 

2 

A 

= 

0 

0 0 

_ 0 

0 

1 

Z 

0 0 

-1  _ 

■J 

_ 0 

0 0_ 

i(AxQA 

1 + 

a2QA'  + 

A3QA')H'  + 

(1  + 

4 + 

2 

a2  + 

2 

a3 

)R 

6qL  + 2q2  + 2q3  + 6^ 


-ch 


-q-: 


2q3  + 6r2  _j 


(4.16) 


C„  = 


H(A2QA^  + A3QA£)H'  + (a3  + a2a1  + a^R 

0 

-j  ± 

0 -q,  - J 


-4qi  ~ q2  + ^3  “ 4ri 


(4.17) 


C2  = H(A3QApH'  + (a2  + aia3)R 


ql  + rl 


2 -J 


(4.18) 
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Given  Cq,  C^,  and  C^,  it  can  be  observed  from  (4. 16)— (4. 18)  that 
only  , and  (q^  + r^)  can  be  uniquely  determined.  Thus  R and 
also  Q cannot  be  uniquely  determined  in  this  case.  On  the  other  hand, 
if  it  is  known  beforehand  that  the  first  component  of  the  state  is  not 
directly  excited  by  the  system  noise,  i.e.,  q^  = 0,  the  measurement 
noise  covariance  R can  be  uniquely  determined. 

In  case  that  the  number  of  unknown  entries  of  Q and  R is  greater 
than  n(f),  the  SPMC  technique  does  not  provide  unique  estimates  of  the 
noise  covariances.  For  such  a case,  one  might  want  to  estimate  the 
optimal  filter  steady-state  gain,  if  it  exists,  instead.  There  are 
several  techniques  available  for  estimating  the  optimal  filter  steady- 
state  gain,  see  Mehra  (1970),  Carew  and  Belanger  (1973),  Neethling  and 
Young  (1974),  Tse  and  Weinert  (1975),  and  Taj ima  (1978). 

4 . 5 Simulation  Results 

A simulation  study  of  the  SPMC  technique  is  given  in  this  section. 
The  system  used  in  the  simulation  study  is  described  by  (4.1)  and  (4.2) 
with 


1 

F 

_ 0 

i 1 

r- < r— 1 

H = 

[ 1 

II 

O' 

o 

_ql 

_0 

1 1 

OJ 

o c r 

and  and  v^  are 

Gaussian 

white 

noise 

sequences . 

The  minimal  polynomial  of  F is  given  by 


f (d)  = d2  - 2d  + 1 

Thus,  the  stationary  preprocessed  measurement  can  be  expressed  as 

\ - yk-  2=Vl  + ?k-2 

Here,  the  autocovariances  of  z,  are 

k 
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Cq  - 2q1  + q2  + 6r 

C1  = -*1  " 4r 

C2  “ r 

Given  measurements  of  {y  0<k<N} , the  estimates  of  C , C..  , and  CL 

K o 1 2 

can  be  obtained  according  to  (4.9).  The  estimates  of  , q^ , and  r can 
then  be  given  by 

A A 

r * C2 

A A A 

qx  “ - ( C1  + 4r  ) 

A AAA 

q2  = CQ  - 2qx  - 6r 


NOISE  VARIANCES 

% 

q2 

r 

ACTUAL  VALUE 

1.5 

1.0 

0.5 

RUN 

ESTIMATES 

NUMBER 

A 

q, 

A 

q2 

A 

r 

1 

1.503 

1.002 

0.489 

2 

1.790 

1.021 

0.437 

3 

1.328 

1.160 

0.496 

4 

1.555 

0.813 

0.  446 

5 

1.538 

1.001 

0.  506 

6 

1.677 

0.939 

0.461 

7 

1.489 

0.933 

0.534 

8 

1.420 

1.053 

0.  543 

9 

1.550 

1.094 

0.467 

10 

1.509 

0.982 

0.485 

Table  4,1:  Simulation  Results. 
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Figure  4,1:  The  Estimates  of  Noise  Variances. 


Based  on  Theorem  4.1  and  Guassian  properties  of  the  noise 
sequences,  the  estimates  of  noise  variances  are  expected  to  converge  to 
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their  actual  value.  Ten  simulation  runs  with  different  noise  sequences 
were  made.  The  number  of  measurements  used  for  each  run  was  4000.  The 
simulation  results  are  summarized  in  Table  4.1.  The  estimates  of  the 
noise  variances  q^,  q^ , and  r for  the  first  simulation  run  are  depicted 
in  Fig.  4.1  where  the  noise  variance  estimates  are  plotted  for  every  new 
100  data  points.  It  is  evident  from  Table  4.1  and  Fig.  4.1  that  the 

A A 

rate  of  convergence  of  q^  is  rather  slow  in  comparison  with  those  of  q^ 

A 

and  r. 

4 . 6 Concluding  Remarks 

A direct  technique  for  estimating  the  unknown  noise  covariances 
which  is  referred  to  as  the  SPMC  technique  has  been  discussed  and 
analyzed.  By  utilizing  the  minimal  polynomial  of  the  state  matrix,  we 
have  described  how  stationary  preprocessed  measurements  can  be  obtained. 
The  unknown  noise  covariances  can  then  be  directly  estimated  from  the 
sample  autocovariances  of  the  stationary  preprocessed  measurements 
without  requiring  the  estimate  of  the  state  and  the  stationarity  of  the 
measurements . 

It  has  been  observed  that  stationary  preprocessed  measurements  can 
be  obtained  by  using  any  polynomial  which  is  satisfied  by  the  state 
matrix.  But,  the  minimal  polynomial  gives  the  maximum  number  of 
independent  equations  which  can  be  used  to  determine  the  unknown  noise 
covariances . Under  certain  conditions  on  the  fourth  moments  of  the 
noise  processes,  it  has  been  shown  that  the  estimates  of  the  noise 
covariances  converge  in  quadratic  mean  to  their  actual  value.  Unique 
determinability  of  noise  covariances  has  also  been  discussed.  In 
particular,  we  have  observed  that  the  measurement  noise  covariance  can 


71 


always  be  uniquely  determined  if  the  state  matrix  is  nonsingular.  In 
case  that  the  state  matrix  is  singular,  one  may  or  may  not  be  able  to 
uniquely  determine  the  measurement  noise  covariance. 


CHAPTER  V 

DETECTION  OF  ABRUPT  CHANGES  IN  NOISE  COVARIANCES 


Unexpected  changes  often  occur  in  dynamical  systems.  These 
changes  may  degrade  the  system  performance.  Such  system  changes  are 
often  termed  "failures".  There  are  several  ways  that  failures  can 
arise.  Actuator  failures  and  sensor  failures  are  a few  examples.  To 
improve  the  performance  of  systems  under  failures,  the  failures  must  be 
promptly  detected,  isolated,  and  accommodated.  We  shall  refer  to  a 
system  which  performs  these  three  tasks  as  a failure  detection  system. 

For  the  past  decade,  a wide  variety  of  techniques  have  been 
proposed  for  designing  failure  detection  systems.  Extensive  surveys  on 
this  subject  can  be  found  in  Willsky  (1976)  and  Isermann  (1984).  To 
design  a failure  detection  system,  the  model  of  the  corresponding 
dynamical  system  must  be  given.  In  some  practical  cases,  uncertainties 
in  system  models  arise.  This  leads  to  the  issue  of  robust  failure 
detection  which  has  been  investigated  by  Leininger  (1981)  , Chow  and 
Willsky  (1984)  and  Lou  et  al . (1986). 

Here,  we  are  concerned  with  a special  type  of  failures,  namely, 
sensor  failures.  Sensor  failures  can  occur  in  several  forms.  For 
example,  sensors  may  become  completely  non- operational  or 
unsatisfactorily  operational.  In  the  former  case,  sensors  simply  stop 
operating.  While  in  the  latter  case  the  measurements  provided  by 
sensors  contain  unexpected  disturbances.  The  degradation  in  sensor 
performance  caused  by  these  disturbances  may  be  in  the  form  of  biases  or 


72 


73 


increased  inaccuracies. 

The  degradation  in  sensor  performance  which  we  are  concerned  here 
is  in  the  form  of  increased  inaccuracies.  As  mentioned  in  Willsky 
(1976),  this  degradation  can  be  modeled  as  abrupt  increases  in 
measurement  noise  covariances.  For  the  degradation  in  the  form  of 
biases,  readers  may  refer  to  Deckart  et  al . (1977)  and  Clark  and  Setzer 
(1980)  for  further  discussions.  The  material  presented  in  this  chapter 
is  a part  of  author's  research  reported  in  Bullock  et  al.  (1987). 


5 . 1 Problem  Statement 

Consider  the  discrete  stochastic  dynamical  system  described  by 

Vi  - *ic\  + GA  + wk  • xo  - N(V  V • wk  ~ N(0>  V 

yk  = \\  + vk  : vk  ~ N(°.  Rk)  with  \ > 0 (5.2) 

where  x^,  u^  , and  y^  denote  the  state,  the  deterministic  input,  and 

the  measurement,  respectively,  v^  and  denote  respectively  the  system 

noise  and  measurement  noise,  assumed  white.  The  x , {w  } , and  {v  } are 

OK  K 

assumed  to  mutually  uncorrelated. 


The  estimate  of  x^  given  {yQ , , . . , y^  ^ can  be  obtained 

from  the  Kalman  filter.  With  the  above  assumptions,  x^^  ^ is  the 

unbiased  minimum  variance  estimate.  Here  we  also  assume  that 
H^)  is  uniformly  detectable  so  that  the  one-step  predictor 
error  covariance  M^,  is  uniformly  bounded  (Anderson  and  Moore,  1981). 
Let  us  now  consider  the  case  that  sensors  used  to  provide  measurements 
can  fail  at  some  unknown  time,  say  k^  . The  effect  of  sensor  failures 
is  assumed  to  be  in  the  form  of  increased  inaccuracies  and  is  modeled  as 


increases  in  measurement  noise  covariances.  With  this  assumption,  the 
situation  is  that  the  filter  has  been  designed  for  measurement  noise 
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>Rk 

for  k > kQ  . Given  measurements  y^,  we  would  like  to  determine  if 

k > k . 
o 

We  shall  develop  a residual -based  detection  scheme  for  the  problem 
stated  above.  It  should  be  noted  that  a standard  voting  scheme  can  also 
be  employed  for  this  problem.  But  the  voting  scheme  requires  at  least 
three  similar  sensors  in  order  to  detect  the  occurrence  of  failures  in 
one  sensor.  Because  of  its  expensiveness  in  implementation,  the  voting 
scheme  is  thus  unattractive . 

5 • 2 The  Effect  of  Sensor  Failures  on  the  Residual  Sequence 

Let  us  now  investigate  the  effect  of  sensor  failures  on  the 
residual  sequence  {y  . 

K I K 

used  in  computing  the  Kalman  filter  gain  K^  instead  of  the  correct 
covariance,  R^  . Consequently,  computed  from  the  Kalman  filter  is 
not  the  actual  one-step  predictor  error  covariance.  It  should  be  noted 

A 

that  the  estimate  x^^  ^ obtained  from  the  Kalman  filter  is  no  longer 
the  minimum  variance  estimate  of  x^.  for  k > kQ  . But  ^ is  still 

an  unbiased  estimate  of  x^  . It  is  important  to  observe  that  even 
though  we  may  not  have  true  divergence,  i.e.,  the  actual  mean- squared 
error  of  the  estimate  becomes  unbounded  as  time  increases,  in  the 
situation  described  above,  it  is  quite  possible  that  the  estimation 
errors  may  become  intolerably  large.  This  phenomenon  which  is  known  as 
apparent  divergence  (Fitzgerald,  1971)  is  equivalent  in  a practical 
sense  to  true  divergence. 

As  discussed  in  Chapter  II,  the  actual  one-step  predictor  error 
convariance  M,  satisfies  the  following  difference  equation. 


. When  the  sensors  have  failed,  It  is  still 


covariance  R^  but  yet  the  actual  measurement  noise  covariance  is  R£ 


75 


Ci  - 

fkCk 

+ vX  + \ : k - k0 

where 

II 

ItT* 

Fk  - 

w 

Also 

"k+l  - 

Wk 

+ Wi  + \ 

Thus 

Ci-  ^i  - 

f(I£- 

' — ✓ 
+ 

^o 

1 

£ 

IV 

!V 

O 

Since  and  , we  then  have  that 

o o 

< a «k  : k a ko 

It  follows  from  (5.6)  and  R^  > R^  that 

cov(yk|k-r  yk|k-i}  = Hk  “k  Hk  + *k  : 0 * k < kc 


> H, 


“k  Hk  + «k  : k * ko 


(5.3) 

(5.4) 

(5.5) 


(5.6) 


(5.7) 


This  suggests  that  determining  if  k > kQ  is  equivalent  to  testing 
if  the  convariance  of  the  residual  sequence  has  increased.  It  should  be 
pointed  out  that  i-s  white  for  0 < k < kQ,  but  it  is  no  longer 
white  for  k > kQ  . However,  {yk|k  rs  a zero -mean  Gaussian  noise 
sequence  for  all  k > 0 with  covariance  satisfying  (5.7). 


5 . 3 Hypothesis  Testing  for  Sensor  Failures 


Since 

(Hk 

+ 

V 

is  symmetric 

positive  definite, 

there 

exists  a nonsingular  matrix 

Tk 

such  that  Tk(Hk  \ + Rk)  T£  - 

I. 

Let  us 

now 

define 

Zk 

: = 

Tk  yk|k-l 

(5.8) 

Then 

Zk 

- 

N(0,  Vk) 

(5.9) 

where 

\ 

= 

i ; 

0 < k < k 

o 

> 

i ; 

k > k 
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Since  {yk|k.1)  is  white  for  0 < k < kQ  , Zq  , zk  -1  are 

o 

independent.  Without  sensor  failures,  one  can  therefore  consider  z^  as 

independent  observations  of  a random  vector  z normally  distributed  with 

zero  mean  and  identity  covariance.  The  hypothesis  test  for  sensor 

failures  can  be  formulated  as  follows . Given  n observations  z of  a 

k 

random  vector  z normally  distributed  with  zero  mean  and  unknown 
covariance  V.  We  perform  the  following  multivariate  hypothesis  test. 

Hq  : sensors  have  not  failed  V = I 

„ (5.10) 

: sensors  have  failed  V > I 

To  obtain  a test  statistic  for  the  above  one-sided  hypothesis 
test,  we  shall  use  Roy's  union- intersection  principle  (Roy,  1953).  The 
union- intersection  principle  is  widely  used  in  the  area  of  simultaneous 
test  procedures  (Krishnaiah,  1979).  However,  it  seems  to  be  little 
known  to  engineers.  The  ideas  of  the  union- intersection  principle  can 
be  briefly  stated  as  follows.  Given  a multivariate  hypothesis  H,  we 
decompose  H into  a union  or  an  intersection  of  several  univariate 
hypotheses.  For  each  univariate  hypothesis,  we  develop  a test  statistic 
and  obtain  accordingly  an  acceptance  region  and  a critical  region.  The 
acceptance  region  and  the  critical  region  of  H are  then  obtained  by 
appropriately  intersecting  or  unioning  all  acceptance  regions  and 
critical  regions  of  the  univariate  hypotheses.  As  mentioned  in  Roy 
(1957) , if  the  test  for  each  univariate  hypothesis  has  certain  optimum 
properties,  then  the  test  for  H obtained  by  the  above  procedure  will 
have  some  reasonably  good  properties . 

Now  observe  that  V = I iff  a'  V a - a' a for  all  a in  RP  , 
where  p denotes  the  dimension  of  z.  An  equivalent  hypothesis  test  to 
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(5.10)  can  then  be  given  by 


H 

o 

a'  V a 

a'a 

for 

all 

a in 

Hi  : 

a'  V a > 

a'a 

for 

some 

a in 

Let  us 

next  consider  a 

subhypothesis 

test 

for  a 

(5.11) 


fixed  non- zero  a 


in  IT 


Ha  : a'  V a = a' a 

o 

Ha  : a'  V a > a' a 


(5.12) 


Consider  a test  statistic  t(a) 


a'  A a 
a'a 


(5.13) 


where 


n - 1 

A :=  2 z.  z'.  (5.14) 

i = 0 1 1 v ' 


It  can  be  shown  (see  Appendix  D)  that  t(a)  has  a chi-squared 

distribution  with  n degrees  of  freedom  if  z - N(0,I).  We  may  reject 
£ 

Hq  if  t(a)  > tQ  where  tQ  is  some  positive  number.  It  should  be  noted 
that  the  test  based  on  this  critical  region  is  a uniformly  most  powerful 
test  for  given  independent  observations  (see  Appendix  D) . 

Notice  that 


H 

o 


n Hc 


and 


H, 


U HI 


a^o  “ " a^o  A 

This  suggests  that  we  accept  H if  t(a)  < t for  all  non-zero  a in 

o o 

or  equivalently  m^x  t(a)  < tQ  , and  we  reject  Hq  otherwise.  Using 
the  symmetric  property  of  A,  one  can  show  that  (Noble  and  Daniel,  1977): 


R*\ 


max  t(a) 
afo 


max 

afo 


a'  A a 
a'a 


A (A) 
max 


(5.15) 


where  A^  (A)  denotes  the  largest  eigenvalue  of  A. 
used  as  a test  statistic  for  (5.10). 


Thus  A (A)  may  be 
max  J 
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The  random  matrix  A defined  in  (5.14)  is  well  known  to  be 
distributed  according  to  the  Wishart  distribution  W(I,n)  under  the  null 
hypothesis  Hq  (Anderson,  1984).  The  distribution  of  the  largest 
eigenvalue  of  A has  a density  function  of  a certain  form  depending  on  p 
and  n.  The  exact  form  of  the  density  function  can  be  found  in 
Krishnaiah  and  Chang  (1971). 
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44  45 

46.74 

48.95 

1 1 
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20 

43.08 
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Table  5.1:  Upper  Significance  Points  of  A (A) 

max 

The  approximate  values  of  certain  upper  significance  points  of 

Amax^  obtained  from  Pearson  and  Hartley  (1972)  are  given  in  Table  5.1. 

Let  A be  number  such  that  Pr{ A (A)  > A } = a under  H . For  example, 
w max  ot  o r 

if  P = 2,  n = 10,  and  a = 0.01,  then  from  Table  5.1  A = 

a 


27.63.  We 
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then  reject  H and  accept  H..  with  significance  level  a if  A (A)  is 
° 1 max 

greater  or  equal  to  A 

a 

So  far,  we  have  shown  that  the  largest  eigenvalue  of  random  matrix 

A = n.S,  where  S is  the  sample  covariance  matrix  based  on  n samples  of 

normalized  residual  sequence,  may  be  used  as  a test  statistic  in 

detecting  sensor  failures.  However,  a chi -squared  test,  which  was 

suggested  in  Mehra  and  Peschon  (1971)  and  applied  by  Willsky  et  al. 

(1975)  to  detect  the  occurrence  of  system  abnormalities,  can  also  be 

used  to  detect  abrupt  increases  in  measurement  noise  covariances.  The 

chi -squared  test  is  based  on  the  trace  of  A.  Under  the  null  hypothesis 

V t-r(A)  has  a chi-squared  distribution  with  n.p  degrees  of  freedom. 

Let  us  now  investigate  the  relationship  between  test  statistics  A (A) 

max 

and  tr (A) . 

Since  A is  symmetric,  there  exists  an  orthogonal  matrix  T such 

that 


T'  A T 

= 

D 

where  D is  diagonal. 

Now  let  a 

: = 

([11. 

. 1]  T'  )' 

(5.16) 

Then  tr(A)  = a'  A a . 

= p ■ a'  A a since  a' a = p. 

a'  a 

Consequently,  tr(A)  = p.t(i)  (5.17) 

It  should  be  noted  that  a is  itself  a random  vector  since  T is  a 

random  matrix  depending  on  A.  As  far  as  the  constant  p is  concerned, 

hypothesis  tests  based  on  p.t(a)  and  t(a)  are  equivalent.  From  (5.17), 

we  can  see  that  when  we  use  tr(A)  as  a test  statistic  for  (5.11),  we  are 

restricting  our  attention  to  subhypothesis  test  Ha  against  Ha  with 

o ° 1 
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a - a . But  in  developing  test  statistic  A (A)  for  (5.11),  we  have 

max 

considered  subhypothesis  tests  H*  against  H*  for  all  nonzero  a in  RP, 
which  of  course  include  the  subhypothesis  test  H*  against  . In  this 
sense,  test  statistic  \__„(A)  is  more  general  than  test  statistic  tr(A) 
is . 


5 • 4 A Scheme  for  Detecting  Sensor  Failures 

Let  us  now  develop  a scheme  for  detecting  sensor  failures.  Given 
lYklk-l1  ’ n now  denote  the  value  of  the  window  length  used  in  the 
detection  scheme,  and  let  k denote  time  index  when  the  detection  scheme 
is  performed  with  k > n-1. 


Define 


Ak 

where  is  defined  in  (5.8). 


2 z.  zl 
i -k -n+ 1 1 1 


:=  A 


max 


(V 


(5.18) 

(5.19) 


Figure  5.1:  Sensor  Failure  Detection. 
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Let  a be  the  significance  level  used  in  the  hypothesis  test,  and 
Xa  be  the  threshold  corresponding  to  a.  The  detection  scheme  is  that  if 
> we  decide  that  the  sensors  have  not  failed  up  to  time  k,  or 
else  we  decide  that  the  sensors  have  failed  at  some  particular  time 
before  or  at  time  k.  A block  diagram  illustrating  the  detection  scheme 
is  depicted  in  Fig.  5.1. 

After  sensor  failures  are  detected,  one  needs  to  adjust  the 
filter  gain  to  accommodate  the  failures.  Here,  we  propose  a simple 

scheme  for  the  adjustment.  The  idea  is  to  obtain  the  estimate  of 

C0V^yk|k-l’yk|k-l^  and  then  adJust  the  filter  gain  accordingly. 

Based  on  the  assumption  that  COH^y^^  l^^lk  1^  constant  over  N 

consecutive  samples,  the  sample  covariance  of  y.  , , denoted  by  S 

k|k-l  J k’ 

is  computed  and  used  as  an  estimate  of  COX^y^^  i^^lk  1^'  Assume 
further  that  (M^  — M^)  is  not  considerably  large  when  the  failures  are 
detected.  The  filter  gain  may  then  be  given  by 

„-l 


Fk  “k  Hk  Sk 


where 


N i-k-N+i  yk|k-l  yk|k-l 


(5.20) 

(5.21) 


It  should  be  noted  that  adjusting  according  to  (5.20)  is 
equivalent  to  adjusting  according  to  (5.22),  where  r£  may  be 
interpreted  as  an  estimate  of  the  measurement  noise  covariance. 


‘He 


Fk  \ “i  <Wic  - 


(5.22) 


where  r£  Sk  - ^ H’  (5.23) 

It  is  important  to  observe  that  when  is  computed  from  (5.20), 
the  filter  error  covariance  , which  is  approximated  by  (5.24),  may 

not  be  positive  semi-definite. 
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.-1 


(5.24) 


pk  - "k-  "k^k  Ht\ 

A sufficient  condition  for  P^  being  positive  semi-definite  is 
that  (S^  — > 0.  This  condition  does  not  always  hold  since  is 

only  the  sample  covariance  of  . But  from  (5.7),  we  know  that 

C0V<^k|k-l'^k|k-l)  > Hk"kHk  + \ for  k>kQ. 

The  above  information  about  the  covariance  of  y,  „ should  also 

vk|k-l 

be  utilized  in  adjusting  the  filter  gain  K,  . Thus,  we  may  simply 
replace  by  HjMjH^  + when  (Sk  - H^H^.  - 1^)  is  not  positive 

semi-definite.  By  doing  this,  P^  is  guaranteed  to  be  positive  semi- 
definite  at  the  same  time.  The  sensor  failure  detection  system 
described  above  has  been  reported  by  Bullock  et  al.  (1987)  based  on 
their  Monte-Carlo  study  to  work  satisfactorily. 


It  should  be  noted  that  sensor  failures  may  also  be  accommodated 
by  simply  turning  off  the  failed  sensors.  But  this  may  make  the  system 
become  unobservable.  Once  the  system  is  unobservable,  the  state  error 
covariance  may  become  unbounded  as  time  increases.  In  particular,  if 
the  state  transition  matrix  of  the  system  is  not  exponentially  stable, 
the  state  error  covariance  can  be  intolerably  large  within  a short 
period  of  time  after  the  system  becomes  unobservable. 


5 . 5 Concluding  Remarks 

Abrupt  increases  in  measurement  noise  covariances  due  to  sensor 
failures  in  the  form  of  increased  inaccuracies  have  been  considered.  It 
has  been  shown  that  this  type  of  sensor  failures  affects  the  residual 
sequence  by  an  unknown  increment  in  its  covariances.  Consequently,  the 
sensor  failures  can  be  detected  by  simply  testing  whether  or  not  the 
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covariance  of  the  residual  sequence  has  increased.  Utilizing  Roy's 
un ion- intersection  principle,  we  have  shown  that  the  test  statistic 
which  can  be  used  in  the  detection  is  the  largest  eigenvalue  of  random 
matfix  A = n.S,  where  S denotes  the  sample  covariance  matrix  based  on  n 
samples  of  the  normalized  residual  sequence. 

The  sensor  failure  detection  developed  in  this  chapter  can  be 
directly  applied  to  formulate  an  adaptive  state  estimation  scheme  which 
is  robust  against  sensor  failures.  In  state  estimation,  one  can  improve 
the  robustness  against  sensor  failures  by  simply  using  several  different 
types  of  sensors.  Since  the  sensors  are  implemented  with  dissimilar 
instruments,  they  are  unlikely  to  fail  at  the  same  time.  For  instance, 
an  active  radar  seeker  and  an  imaging  infrared  seeker  are  usually  used 
in  target  tracking.  However,  failure  in  one  sensor  can  deteriorate  the 
overall  state  estimate.  For  this  reason,  one  might  want  to  use  sensor 
failure  detection  to  identify  the  failed  sensor  and  accommodate  the 
failure  accordingly. 


CHAPTER  VI 
CONCLUSIONS 


This  dissertation  has  analyzed  Kalman  filtering  under  uncertainty 
in  noise  covariances.  Particular  emphasis  has  been  given  to  discrete- 
time Kalman  filtering.  The  dissertation  consists  of  three  main  parts. 

In  the  first  part,  we  considered  situations  in  which  the  Kalman 
filter  is  designed  by  using  incorrect  noise  covariances.  Behavior  of 
the  Kalman  filter  under  incorrect  noise  covariances  was  analyzed. 
Particularly,  we  were  interested  in  the  characteristic  of  the  actual 
performance  of  the  filter.  The  filter  performance  was  quantified  by  the 
actual  error  covariance  of  the  state  estimate.  Through  this  quantity, 
the  characteristic  of  the  filter  was  studied.  In  particular,  two 
important  properties  of  the  actual  error  covariance,  convergence  and 
divergence,  were  investigated.  The  contribution  of  the  results 
developed  in  this  part  is  that  they  help  one  to  understand  and  be  able 
to  predict  certain  behavior  of  the  Kalman  filter  when  inexact  values  of 
noise  covariances  are  used.  This  is,  of  course,  important  since  the 
exact  values  of  the  noise  covariances  are  hardly  known  in  most  practical 
cases . 

In  the  second  part,  situations  in  which  the  noise  convariance  are 
unknown  were  considered.  A direct  technique  for  estimating  the  unknown 
noise  covariances  which  was  referred  to  as  the  SPMC  technique  was 
discussed  and  analyzed.  The  technique  is  direct  in  the  sense  that  the 
unknown  noise  covariance  can  be  estimated  without  requiring  the  estimate 


84 


85 


of  the  state  and  the  stationarity  of  the  measurements.  A sufficient 
condition  for  which  the  estimates  of  the  noise  covariances  converge  in 
quadratic  mean  to  their  actual  value  was  established.  In  addition, 
discussions  on  unique  determinability  of  the  unknown  noise  covariances 
were  given. 

In  the  third  part,  we  considered  cases  in  which  the  noise 
covariances  are  subject  to  abrupt  changes.  Particularly,  we  were 
interested  in  abrupt  increases  in  measurement  noise  covariances  which 
were  used  to  model  the  effect  of  sensor  failures  in  the  form  of 
increased  inaccuracies.  It  was  shown  that  this  type  of  sensors  failures 
affected  the  residual  sequence  by  an  unknown  increment  in  its 
covariance.  The  sensor  failures  can  therefore  be  detected  by  simply 
testing  whether  or  not  the  covariance  of  the  residual  sequence  has 
increased.  It  was  shown  that  the  test  statistic  which  could  be  used  in 
the  detection  was  the  largest  eigenvalue  of  random  matrix  A = n.S  where 
S denotes  the  sample  covariance  matrix  based  on  n samples  of  the 
normalized  residual  sequence.  A scheme  for  detecting  abrupt  increases 
in  measurement  noise  covariances  was  then  developed  in  this  part. 


APPENDIX  A 

STABILITY  OF  LINEAR  SLOWLY  TIME -VARYING  SYSTEMS 


In  this  appendix,  we  briefly  discuss  stability  of  linear  slowly 
time -varying  systems.  First,  let  us  consider  the  continuous- time  system 
described  by 

x = A(t)  x (A . 1) 

It  is  well  known  that  stability  of  the  above  time -varying  system, 
in  general,  cannot  be  concluded  from  stability  of  the  corresponding 
frozen-time  systems.  To  see  this,  we  consider  the  following  two 
examples . 


by 


Example  A.l  (Vidyasagar,  1978):  Let  A(t)  be  given  by 

2 


A(t) 


— 1 + a. cos  t 

-1  — a. sin  t.cos  t 


1 — a. sin  t.cos  t 
2 

— 1 + a. sin  t 


Then,  the  corresponding  state  transition  matrix  $(t,0)  is  given 


*(t,0) 


— e 


(a-l)t 

cos  t 

-t  . 

e sm  t 

(a-l)t  . 

sm  t 

-t 

e cos  t 

Observe  that  the  eigenvalues  of  A(t)  are  independent  of  t and 
satisfy  the  following  characteristic  equation. 

2 

s + (2  - a)s  + (2  - a)  = 0 

Now,  if  a < 2,  then  A(t)  has  all  its  eigenvalues  with  negative 
real  parts  for  all  t.  It  is  however  clear  from  the  above  expression  of 
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$(t,0)  that  the  system  (A.l)  is  unstable  for  all  a > 1.  This  shows  that 
stability  of  the  system  (A.l),  in  general,  cannot  be  predicted  from 
stability  of  the  frozen-time  systems,  i.e.,  from  the  eigenvalues  of  A(t) . 


as 


Example  A. 2 (Skoog  and  Lau,  1972):  Let  A(t)  be  given  by 


A(t) 

The  corresponding 


— 2—i 

-1  + a. sin  t.cos  t 1 + a. cos  t 
2 

—1  — a. sin  t —1  — a. sin  t.cos  t 

state  transition  matrix  $(t,0)  can  be  expressed 


$(t,0) 


-t 

1 cos  t 

e t(sin  t + a. t.cos  t) 

-t  . 
s sin  t 

-t . . 

e (cos  t — a.t.sm  t) 

It  can  be  shown  that  the  eigenvalues  of  A(t)  are  independent  of  t 
and  satisfy  the  following  characteristic  equation. 

2 

s + 2s  + (2  + a)  = 0 


For,  a < -2,  A(t)  has  an  eigenvalue  with  a positive  real  part. 
But  yet,  the  system  (A.l)  is  exponentially  stable  for  all  a.  Thus,  this 
example  shows  that  instability  of  the  system  (A.l),  in  general,  can 
neither  be  predicted  from  the  eigenvalues  of  A(t) . 

In  case  that  the  variation  of  A(t)  is  sufficiently  slow,  stability 
of  the  system  (A.l)  can  however  be  predicted  from  the  eigenvalues  of 
A(t).  This  fact  was  first  pointed  out  by  Rosenbrock  (1963).  The 
following  theorem  which  has  been  used  in  Chapter  3 is  due  to  Desoer 
(1969)  . 

Theorem  A.l:  Let  A(t)  be  differentiable  for  all  t > 0.  Suppose 

that 

(i)  A(t)  is  uniformly  bounded,  and 

(ii)  A(t)  has  all  its  eigenvalues  with  real  parts  less  than  or 
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equal  to  -a  V t > 0,  for  some  positive  number  a . 

Then,  there  exists  a positive  number  e such  that  if  sug  || A ( t ) ||  < e , 
the  system  (A.l)  is  exponentially  stable. 

For  discrete- time  system 

*k+l  \ \ (A- 2) 

Desoer  (1970)  has  proven  the  following  theorem  which  is  the 
discrete- time  counterpart  of  Theorem  A.l. 

Theorem  A. 2:  Suppose  that 

(i)  A^  is  uniformly  bounded,  and 

(ii)  A^  has  all  its  eigenvalues  inside  or  on  the  circle,  centered 
at  the  origin,  of  radius  p V k > 0,  for  some  p 6 (0,1). 

Then,  there  exists  a positive  number  e such  that  if  sug  ||a^  A^|| 

the  system  (A. 2)  is  exponentially  stable. 


< £ 


APPENDIX  B 

CONVERGENCE  PROPERTY  OF 
SOLUTIONS  OF  THE  RICCATI  DIFFERENCE  EQUATION 


In  this  appendix,  we  shall  prove  the  convergence  property  of 
solutions  of  the  RDE  stated  in  the  proof  of  Corollary  2.3.  Let  us  now 
consider  the  RDE  given  by  (2.3)  with  constant  F,  H,  Q,  and  R.  The 
following  lemma  demonstrates  how  solutions  of  the  RDE  with  different 
initial  values  are  related. 


Lemma  B;  Let  ^ and  2 be  solutions  of  the  RDE  with  different 
initial  values.  Then, 


k+1 


Fk,l  Ak  Fk, 2 


where 

with 


A "k,2  Fk,j  F-Kk.JH 

"kj  FHk,jH'<mkjH’  + R)-1,  for  j - 1,2. 

Proof:  Observe  that 


(B . 1) 


\+l,l 
Thus , A. 


= F. 


k,lMk.lF'  + Q 


and 


“k- 


+1,2 


FMk 


F' 

,2rk,2 


+ Q . 


k+l  - fk,lMk,1F'  - FMk,2Fk,2 


Pk,l\T'k,2  + ?k,lMk,1H'Kk,2  - \,l™k,2fi,2 


It  can  be  shown  that  the  sum  of  the  last  two  terms  on  the  RHS  of 
the  above  equation  is  equal  to  zero.  Equation  (B.l)  therefore  holds.  ■ 

Let  us  now  prove  the  following  convergence  property  of  solutions 
of  the  RDE. 
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Theorem  B:  Let  (F,H)  be  detectable  and  (F,Q^)  have  no  unreachable 
mode  on  the  unit  circle.  Suppose  that  the  solution  of  the  RDE  with 
initial  value  II  > 0 converges  to  the  stabilizing  solution  M of  the  ARE. 
Then,  the  solution  of  the  RDE  with  initial  value  a. II  for  any  positive 
number  a also  converges  to  M. 


Proof . First  notice  that  the  existence  of  the  stabilizing  solution 
M is  guaranteed  by  the  above  assumptions  on  F,H,  and  Q.  Let  ^ and 
2 solutions  of  the  RDE  with  initial  values  II  and  a. II 

respectively.  From  Lemma  B,  we  have  that 


Hence , 


k+1 


Ak  - (1  - a)$1(k,0)n  *'(k,0) 


(B . 2) 


where 


»j(k,i) 


denotes  the  state  transition  matrix  associated  with 


Fk,j  for  J 


1.2. 


But  **k  2 ~ Q>  + n°nnegative  definite  terms. 

Hence  > a.  *2(k,0)II  «£(k,0)  (B.3) 

Because  of  the  detectability  of  (F,H),  2 is  uniformly  bounded. 

It  then  follows  from  (B.3)  that  II  ^(k,!))  is  uniformly  bounded.  But 

from  Remark  2.4,  we  have  that  F^  ^ is  exponentially  stable.  It  is  now 

clear  from  (B.2)  that  lim  A.  =0.  Therefore,  lim  M,  = M ■ 

k -+«  k k Tc , 2 


Remark  B:  As  stated  in  Remark  2.5,  the  solution  of  the  RDE  with 
initial  value  II  > M,  where  M denotes  the  stabilizing  solution  of  the 
ARE,  converges  to  M.  Based  on  this  fact  and  the  above  theorem,  one  can 
conclude  that  the  solution  of  the  RDE  with  initial  value  II  such  that 
a.II  > M for  some  positive  number  a also  converges  to  M. 


APPENDIX  C 

CONVERGENCE  PROPERTY  OF 

SOLUTIONS  OF  THE  RICCATI  DIFFERENTIAL  EQUATION 


Consider  the  RDE  given  by  (3.5)  with  constant  F,H,Q,  and  R.  In 
this  appendix,  we  shall  prove  two  convergence  properties  of  solutions  of 
the  RDE,  which  have  been  stated  in  Section  3.2.  First,  let  us  prove  the 
following  lemma  which  demonstrates  how  solutions  of  the  RDE  with 
different  initial  values  are  related. 


Lemma C:  Let  P^(t)  and  P2(t)  be  solutions  of  the  RDE  with 


different  initial  values.  Then, 

A(t)  = F1(t)A(t)  + A(t)F£(t) 


(C.l) 


where 

and 


A(t)  :=  P1(t)  - P2(t) 


-1 


Fi(t)  :=  F - Ki(t)H  with  K :=  Pi(t)H'R  A,  for  i = 1,2 


Proof:  A(t)  = F1(t)P1(t)  + P1(t)F[(t)  + K1(t)RK^(t)  + Q 

- F2(t)P2(t)  - P2(t)F'(t)  - K2(t)RK^(t)  - Q 

F1(t)A(t)  + A(t)F'(t) 

+ P1(t)F'(t)  - P1(t)F'(t)  + K1(t)RK|(t) 

+ F1(t)P2(t)  - F2(t)P2(t)  - K2(t)RK' (t) 

It  can  be  shown  that  the  sum  of  the  last  six  terms  on  RHS  of  the 
above  equation  is  equal  to  zero.  Thus,  A(t)  satisfies  (C.l).  ■ 

The  convergence  properties  of  solutions  of  the  RDE  are  given  by 
the  following  two  theorems. 
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Theorem  C 1 : Let  (F,H)  be  detectable  and  (F,  Q1*)  have  no 
uncontrollable  mode  on  the  imaginary  axis.  If  P(0)  > 0,  then  P(t) 
converges  to  the  unique  stabilizing  solution  P of  the  ARE. 

Proof:  Under  the  above  assumptions  on  F,H,  and  Q,  the  stabilizing 
solution  P of  the  ARE  exists  and  is  unique.  Using  Lemma  C,  we  have  that 


(C.2) 


A(t)  = F(t)A(t)  + A(t)F' 
where  A(t)  :=  P(t)  - p 

Hence>  A(t)  = *(t,0)(P(0)  - P)  (C  3) 

where  tf(t,s)  is  the  state  transition  matrix  associated  with  F(t) . 


But  P(t) 


*(t,0)P(0)«'(t,0)  + 


t 

f *(t,s)[K(s)RK'(s)  + Q]flr'(t,s)ds 
o 


Thus  P(t)  > *(t,0)P(0)*'(t,0)  (r  , x 

Since  (F.H)  is  detectable,  P(t)  is  uniformly  bounded.  Because 
of  uniform  boundedness  of  P(t)  and  positive  definiteness  of  P(0)  , 
inequality  (C.4)  implies  that  t(t,0)  is  uniformly  bounded.  Using 
uniform  boundedness  of  *(t,0)  and  exponential  stability  of  F,  we  obtain 

from  (C • 3)  that  A(t)  converges  to  a zero  matrix.  Consequently, 
lim  P(t)  = P. 


Theorem  C.2:  Let  (F,H)  be  detectable  and  (F.Q1*)  have  no 
uncontrollable  mode  on  the  imaginary  axis.  Suppose  that  the  solution  of 
the  RDE  with  initial  value  n > 0 converges  to  the  unique  stabilizing 
solution  P of  the  ARE.  Then  the  solution  of  the  RDE  with  initial  value 
a.n  for  any  positive  number  a also  converges  to  P. 


Proof:  Let  P^t)  and  P2(t)  be  solutions  of  the  RDE  with  initial 
values  II  and  a. II  , respectively.  By  Lemma  C, 
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A(t) 

F1(t)A(t)  + A(t)F'(t) 

Thus 

A(t) 

(1  - a).*1(t,0)n  *^(t,0) 

(C.5) 

where  * . (t,s)  denotes  the  state  transition  matrix  associated  with  F 

i 

for  i = 1,  2. 

Using  the  same  arguments  given  in  the  proof  of  Theorem  C.l,  we 
have  that  ^(t.O)  II  ^^(t,0)  is  uniformly  bounded.  This  in  turn  implies 
that  n ^2 (t 1 0)  is  uniformly  bounded.  From  Remark  3.5,  we  have  that 

^(t.O)  is  exponentially  stable.  It  is  then  clear  from  (C.5)  that 
ii™  A(t)  — 0.  Thus,  P2(t)also  converges  to  P.  ■ 


APPENDIX  D 

DISTRIBUTION  AND  PROPERTY  OF  TEST  STATISTIC  t(a) 


Let  z^,  z^  be  n independent  observations  of  a p- dimen- 
sional random  vector  z - N(0,I).  Let  us  first  prove  that  t(a)  defined 
below  has  a chi-squared  distribution  with  n degrees  of  freedom. 


t(a) 


a'  A a 
a'  a 


where 


: — i?i  zi  Zi  an<^  a is  a nonzero  vector  in  R . 


Let  us  state  the  following  two  theorems  (Lancaster,  1969)  needed 
for  completing  the  proof. 

Theorem — D_._  1 : Let  x and  y be  independent  random  variables 

distributed  according  to  chi-squared  distributions  with  k and  m degrees 
freedom,  respectively.  Then  x + y has  a chi-squared  distribution 
with  k + m degrees  of  freedom. 


Theorem  D.2:  Let  p-dimensional  random  vector  x - N(0,I),  B be  a 
pxp  symmetric  matrix,  and  m < p.  Then  x'  B x has  a chi-squared 
distribution  with  m degrees  of  freedom  iff  B is  idempotent  of  rank  m. 


Let  us  now  prove  that  t(a)  has  a chi-squared  distribution  with  n 
degrees  of  freedom.  Based  on  Theorem  D.l,  it  is  sufficient  to  show  that 
a'zz'a  / a' a has  a chi-squared  distribution  with  one  degree  of  freedom. 


From 


a'zz1  a 


a'  a 


z'aa'z 

a'a 


= z'  B z where  B := 


a'a 


aa'  . 
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It  can  be  observed  that  B is  idempotent  of  rank  1.  By  Theorem 
D.2  , a'zz'a  / a'a  has  a chi-squared  distribution  with  one  degree  of 

n 

freedom.  Consequently,  t(a)  :=  ^ a'z^ja  / a'a  has  a chi-squared 
distribution  with  n degrees  of  freedom. 

Given  z^,  z^  independent  observations  of  random  vector  z 

normally  distributed  with  mean  zero  and  unknown  covariance  V.  It  is 
however  known  a priori  that  either  V = I or  V > I holds.  Next,  we  shall 
show  that  for  a subhypothesis  test  with  fixed  non-zero  vector  a : 


Ha  : 

o 

a'  V a 

= 

a'a 

..a 

Hi  : 

a'  V a 

> 

a'a 

the  test  based  on  a critical  region  { t(a)  > t^}  is  a uniformly  most 
powerful  test  with  significance  level  a , where  t(a)  is  defined  above 
and  tQ  is  a number  such  that  Pr{  t(a)  > t } = a . Roughly,  this  means 
that  the  probability  of  rejecting  the  null  hypothesis  when  it  is  false 
of  this  test  is  greater  than  or  equal  to  that  of  any  test  with 
significance  level  less  then  or  equal  to  a . The  exact  definition  of  a 
uniformly  most  powerful  test  with  significance  level  a can  be  found  in 
Mood  et  al . (1974)  or  Lehmann  (1986).  To  prove  the  above  statement,  we 
need  the  following  result  (Mood  et  al . , 1974). 


Theorem  D.3:  Let  x^,...,x  be  independent  observations  of  random 
variable  x whose  density  function  is  given  by 

Px(*;*)  = a(0) .b(x) .exp[c(0) .d(x) ] for  some  parameter  6. 

n 

Let  t(x^, . . . ,xn)  :=  ^2^  d(x^) . If  c(0 ) is  a monotone,  increasing 

function  in  6 and  there  exists  t*  such  that 
Pr { t(x1(...,xn)  > t*  | 9 = 9 ) = a , 
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then  the  test  based  on  critical  region  { t(x1 x ) > t*}  is  a 

1 n 

uniformly  most  powerful  test  with  significance  level  a of 


H : 8 


o 

Now  let 


8 against 
x 


8 

o 


H • 8 > 8 . 

1 o 

:=  a'zz'a  / a'a 
:=  a'  V a 
:=  a'a 


Then 


8 


w 


where  w 


Since  V is  symmetric  positive  definite, 
invertible  matrix  T such  that  T V T'  -I.  Let  y :=  Tz 
Since  T is  invertible,  there  is  a non-zero  b such  that 


a'zz'a 
a'  V a • 

there  exists  an 
then  y - N(0, I) . 
a = T'b. 


Hence , 


a'zz'a 
a'  V a 


b'Tzz'T'b 

b'TVT'b 


b' w'b 
b'b 


But  b'yy'b/  b'b  has  a chi-squared  distribution  with  one  degree  of 
freedom.  Thus  v has  a chi -squared  distribution  with  one  degree  of 
freedom,  and  its  density  function  is  given  by 


exp [ -w/2 ]/(2n  w)^  , w > 0 


pw(w> 

< 

. o 

w < 0 

But 

PX(X) 

= 

ffr—l,  N . 1 df  1(X)  1 

Pw(f  (x>)-  1 dx  1 where 

f(w)  :=  w . 

Thus 

PX(X) 

= 

a(0) .b(x) . exp[c(0) .d(x)] 

o 

where 

a(«) 

: = 

1/(2^  / eQ)h,  b(x)  : = 

-h 
x , 

c(») 

-(V  0)/  2 , and  d(x)  : = 

X. 

It  is 

clear 

that 

c(8)  is  a monotone,  increasing 

function  in  9 , 

and  t(a) 

n 

£ 

i = 1 

d(xt) 

By  Theorem  D.3,  the  test  with 

critical  region 

{ t(a)  > tQ  } is  a uniformly  most  powerful  test  with  significance  level 
a of 


o 


a'a  against 


a'  V a 


a'  V a > a'a  . 


REFERENCES 


Aasnaes , H.B.  and  T.  Kailath  (1974).  "Initial-condition  robustness  of 
linear  least  squares  filtering  algorithms,"  IEEE  Trans.  Automat. 
Contr. , vol.  AC-19,  pp.  393-397. 

Alspach,  D.L  (1972).  "Comments  on:  On  the  identification  of  variances 
and  adaptive  Kalman  filtering,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC- 
17,  pp.  843-845. 

Anderson,  B.D.O.  (1971a).  "Stability  properties  of  Kalman-Bucy 
filters,"  J.  Franklin  Inst.,  vol  291,  pp.  137-144. 

Anderson,  B.O.D.  and  J.B.  Moore  (1979).  Optimal  Filtering,  Prentice- 
Hall,  Englewood  Cliffs,  New  Jersey,  1979. 

Anderson,  B.O.D.  and  J.B.  Moore  (1981).  "Detectability  and 
stabilizability  of  time  varying  discrete  time  linear  systems,"  SIAM  J. 
Contr.  Optimiz.,  vol.  19,  pp.  20-32. 

Anderson,  T.W.  (1971).  The  Statistical  Analysis  of  Time  Series,  John 
Wiley  & Sons,  New  York. 

Anderson,  T.W.  (1984).  An  Introduction  to  Multivariate  Statistical 
Analysis,  John  Wiley  & Sons,  New  York. 

Aoki,  M.  (1987).  State  Space  Modeling  of  Time  Series,  Springer-Verlag, 
Heidelberg,  Germany. 

Belanger,  P.R.  (1974).  "Estimation  of  noise  covariance  matrices  for  a 
linear  time -varying  stochastic  process,"  Automatica,  vol.  10,  pp . 267- 
275. 

Box,  G.E.P.  and  G.M.  Jenkins  (1976).  Time  Series  Analysis:  Forecasting 
and  Control,  Holden-Day,  Oakland,  California. 

Brewer,  J.W.  (1978).  "Kronecker  products  and  matrix  calculus  in  system 
theory,"  IEEE  Trans.  Circuits  Syst.,  vol.  CAS-25,  pp.  772-781. 

Brown,  R.J.  and  A.P.  Sage  (1971a).  "Error  analysis  of  modeling  and  bias 
errors  in  continuous  time  state  estimation,"  Automatica,  vol.  7,  pp. 
577-590. 

Brown,  R.J.  and  A.P.  Sage  (1971b).  "Analysis  of  modeling  and  bias  errors 
in  discrete-time  state  estimation,"  IEEE  Trans.  Aerosp.  Electron. 
Syst.,  vol.  AES-7,  pp.  340-354. 


97 


98 


Bullock,  T.E.,  S.  Sangsuk- lam,  and  T.  Yoon  (1987).  "Sensor  fusion 
study:  Modification  to  experimental  study  of  recent  generalization  of 
optical  matched  filtering,"  Final  Report,  Air  Force  Armament 

Laboratory,  Elgin  Air  Force  Base,  Florida. 

Burg,  J.P.,  D.G.  Luenberger,  and  D.L.  Wenger  (1982).  "Estimation  of 
structured  covariance  matrices,"  Proc.  IEEE,  vol.  70,  pp.  963-974. 

Caines,  P.E.  and  D.Q.  Mayne  (1970).  "On  the  discrete  time  matrix 

Riccati  equation  of  optimal  control,"  Int.  J.  Contr. , vol.  12,  pp. 

785-794. 

Caines,  P.E.  and  D.Q.  Mayne  (1971).  "On  the  discrete  time  matrix 

Riccati  equation  of  optimal  control--a  correction,"  Int.  J.  Contr., 
vol.  14,  pp.  205-207. 

Callier,  F.M.  and  J.L.  Willems  (1981).  "Criterion  for  convergence  of 
the  solution  of  the  Riccati  differential  equation,"  IEEE  Trans. 
Automat.  Contr.,  vol.  AC-26,  pp.  1232-1242. 

Carew,  B.  and  P.R.  Belanger  (1973).  "Identification  of  optimum  filter 
steady-state  gain  for  systems  with  unknown  noise  covariances,"  IEEE 
Trans.  Automat.  Contr.,  vol.  AC-18,  pp.  582-587. 

Chan,  S.W.,  G.C.  Goodwin,  and  K.S.  Sin  (1984).  "Convergence  properties 
of  the  Riccati  difference  equation  in  optimal  filtering  of 
nonstabilizable  systems,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC-29,  pp. 
110-118 

Chow,  E.Y.  and  A.S.  Willsky  (1984).  "Analytical  redundancy  and  the 
design  of  robust  failure  detection  systems,"  IEEE  Trans.  Automat. 
Contr.,  vol.  AC-29,  pp.  603-614. 

Chung,  K.L.  (1974).  A Course  in  Probability  Theory,  Academic  Press,  New 
York. 

Clark,  R.N.  and  W.  Setzer  (1980)  . "Sensor  fault  detection  in  a system 
with  random  disturbances,"  IEEE  Trans.  Aerosp.  Electron.  Syst. , vol. 
AES-16,  pp.  468-473. 

Deckert,  J.C.,  D.N.  Mukund,  J.J.  Deyst,  and  A.S.  Willsky  (1977).  "F-8 

DFBW  sensor  failure  identification  using  analytic  redundancy,"  IEEE 
Trans.  Automat.  Contr.,  vol.  AC-22,  pp.  795-803. 

Desoer,  C.A.  (1969).  "Slowly  varying  system  x = A(t)x  ,"  IEEE  Trans. 
Automat.  Contr.,  vol.  AC-14,  pp.  780-781. 

Desoer,  C.A.  (1970).  "Slowly  varying  discrete  system  x.  ..  = A.  x.  ," 
Electronics  Letters,  vol.  6,  pp.  339-340.  1+  1 1 

De  Souza,  C.E.,  M.R.  Gevers , and  G.C.  Goodwin  (1986).  "Riccati 
equations  in  optimal  filtering  of  nonstabilizable  systems  having 
singular  state  transition  matrices,"  IEEE  Trans.  Automat.  Contr.,  vol. 
AC-31,  pp.  831-838. 


99 


Fagin,  S.L.  (1964).  "Recursive  linear  regression  theory,  optimal 
filtering  theory,  and  error  analyses  of  optimal  systems,"  IEEE  Int. 
Conv.  Rec.,  pp.  216-240. 

Fitzgerald,  R.J.  (1971).  "Divergence  of  the  Kalman  filter,"  IEEE  Trans. 
Automat.  Contr. , vol.  AC-16,  pp.  736-747. 

Friedland,  B.  (1982).  "Estimating  noise  variances  by  using  multiple 
observers,"  IEEE  Trans.  Aerosp.  Electron.  Syst. , Vol.  AES -18,  pp.  442- 
448. 

Fujita,  S.  and  T.  Fukao  (1970).  "Error  analysis  of  optimal  filtering 
algorithm  for  colored  noise,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC- 15, 
pp.  452-455. 

Fuller,  W.A.  (1976).  Introduction  to  Statistical  Time  Series,  John 
Wiley  & Sons,  New  York. 

Griffin,  R.E.  and  A.P.  Sage  (1968).  "Large  and  small  scale  sensitivity 
analysis  of  optimum  estimation  algorithms,"  IEEE  Trans.  Automat. 
Contr.,  vol.  AC-13,  pp.  320-329. 

Griffin,  R.E.  and  A.P.  Sage  (1969).  "Sensitivity  analysis  of  discrete 
filtering  and  smoothing  algorithms,"  AIAA  J.,  vol.  7,  pp.  1890-1897. 

Heffes,  H.  (1966).  "The  effect  of  erroneous  models  on  the  Kalman  filter 
response,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC-11,  pp.  541-543. 

Herstein,  I.N.  (1975).  Topics  in  Algebra,  John  Wiley  & Sons,  New  York. 

Hoeffding,  W.  and  H.  Robbins  (1948) . "The  central  limit  theorem  for 
dependent  random  variables,"  Duke  Math.  J.,  vol.  15,  pp.  773-780. 

Huddle,  J.R.  and  D.A.  Wismer  (1968).  "Degradation  of  linear  filter 
performance  due  to  modeling  error,"  IEEE  Trans.  Automat.  Contr.,  vol. 
AC-13,  pp.  421-423. 

Iglehart,  S.C.  and  C.T.  Leondes  (1974).  "Estimation  of  a dispersion 
parameter  in  discrete  Kalman  filtering,"  IEEE  Trans.  Automat.  Contr., 
vol.  AC-19,  pp.  262-263. 

Isermann,  R.  (1984) . "Process  fault  detection  based  on  modeling  and 
estimation  methods  - A survey,"  Automatica,  vol.  20,  pp.  387-404. 

Jazwinski,  A.H.  (1970).  Stochastic  Processes  and  Filtering  Theory, 
Academic  Press,  New  York. 

Kailath,  T.  (1968).  "An  innovations  approach  to  least- squares 
estimation:  Part  I,  Linear  filtering  in  additive  white  noise,"  IEEE 

Trans.  Automat.  Contr.,  vol.  AC-13,  pp.  646-655. 

Kalman,  R.E.  (1960).  "A  new  approach  to  linear  filtering  and  prediction 
problems,"  ASME  J.  Basic  Eng.,  vol.  82,  pp.  35-45. 


100 


Kalman,  R.E.  (1963).  "New  methods  in  Wiener  filtering  theory,"  Proc. 
Symp.  Eng.  Appl.  Random  Function  Theory  and  Probability,  Boydanoff, 
J.L.  and  F.  Kozin  Eds.,  John  Wiley,  New  York. 

Kalman,  R.E.  and  R.S.  Bucy  (1961).  "New  results  in  filtering  and 
prediction  theory,"  ASME  J.  Basic  Eng.,  vol.  83,  pp.  95-108. 

Kato,  T.  (1976).  Perturbation  Theory  for  Linear  Operators,  Springer- 
Verlag,  New  York. 

Koussoulas,  K.T.  and  C.T.  Leondes  (1986).  "On  the  sensitivity  of  a 
discrete- time  Kalman  filter  to  plant-dynamics  modelling  errors,"  Int. 
J.  Syst.  Sci.,  vol.  17,  pp.  937-941. 

Krishnaiah,  P.R.  (1979).  "Some  developments  on  simultaneous  test 

procedures,"  Chapter  4 in  Developments  in  Statistics,  vol.  2,  Academic 
Press,  New  York. 

Krishnaiah,  P.R.  and  T.C.  Chang  (1971).  "On  the  exact  distribution  of 
the  extreme  roots  of  the  Wishart  and  MANOVA  Matrices,"  J.  Multivariate 
Analysis,  vol.  1,  pp.  108-117. 

Kucera,  V.  (1972a).  "A  contribution  to  matrix  quadratic  equations," 
IEEE  Trans.  Automat.  Contr. , vol.  AC-17,  pp.  344-357. 

Kucera,  V.  (1972b).  "The  discrete  Riccati  equation  of  optimal  control," 
Kybernetika,  vol.  8,  pp.  430-447. 

Kucera,  V.  (1973).  "A  review  of  the  matrix  Riccati  equation," 
Kybernetika,  vol.  9,  pp.  42-61. 

Kwakernaak,  H.  and  R.  Sivan  (1972).  Linear  Optimal  Control  Systems, 
Wiley- Interscience , New  York. 

Lainiotis,  D.G.  and  F.L.  Sims  (1970).  "Sensitivity  analysis  of  discrete 
Kalman  filters,"  Int.  J.  Contr.,  vol.  12,  pp.  657-669. 

Lancaster,  H.O.  (1969).  The  Chi-Squared  Distribution,  John  Wiley  & 
Sons,  New  York. 

Lee,  D.P.,  S.E.  Cohn,  A.  Dalcher,  and  M.  Ghil  (1985).  "An  efficient 
algorithm  for  estimating  noise  covariances  in  distributed  systems," 
IEEE  Trans.  Automat.  Contr.,  vol.  AC-30,  pp.  1057-1064. 

Lee,  T.T.  (1980).  "Direct  approach  to  identify  the  noise  covariances  of 
Kalman  filtering,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC-25,  pp.  841- 
842. 

Lehmann,  E.L.  (1986).  Testing  Statistical  Hypotheses,  John  Wiley  & 
Sons,  New  York. 

Leininger,  G.G.  (1981).  "Model  degradation  effects  on  sensor  failure 
detection,"  Proc.  1981  Joint  Autmat.  Contr.  Conf . , Charlottesville, 
Virginia. 


101 


Lou,  X.C.,  A.S.  Willsky,  and  G.C.  Verghese  (1986).  "Optimally  robust 
redundancy  relations  for  failure  detection  in  uncertain  systems," 
Automatica,  vol.  22,  pp.  333-344. 

Mehra,  R.K.  (1970).  "On  the  identification  of  variances  and  adaptive 
Kalman  filtering,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC-15,  pp.  175- 
184. 

Mehra,  R.K.  (1972).  "Approaches  to  adaptive  filtering,"  IEEE  Trans. 
Automat.  Contr.,  vol.  AC-17,  pp.  693-698. 

Mehra,  R.K.  (1976) . "Optimization  of  measurement  schedules  and  sensor 
designs  for  linear  dynamic  systems,"  IEEE  Trans.  Automat.  Contr.,  vol. 
AC-21,  pp.  55-64. 

Mehra,  R.K.  and  J.  Peschon  (1971).  "An  innovations  approach  to  fault 
detection  and  diagnosis  in  dynamic  system,"  Automatica,  vol.  7.,  pp. 
637-640. 

Molinari,  B.P.  (1973).  "The  stabilizing  solution  of  the  algebraic 
Riccati  equation,"  SIAM  J.  Contr.,  vol.  11,  pp.  262-271. 

Molinari,  B.P.  (1975).  "The  stabilizing  solution  of  the  discrete 
algebraic  Riccati  equation,"  IEEE  Trans.  Automat.  Contr.,  Vol.  AC-20, 
pp.  396-399. 

Mood,  A.M.  , F.A.  Graybill,  and  D.C.  Boes  (1974).  Introduction  to  the 
Theory  of  Statistics,  McGraw-Hill,  New  York. 

Myers,  K.A.  and  B.D.  Tapley  (1976).  "Adaptive  sequential  estimation 
with  unknown  noise  statistics,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC- 
21,  pp.  520-523. 

Neal,  S.R.  (1967).  "Linear  estimation  in  the  presence  of  errors  in 
assumed  plant  dynamics,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC-12,  pp. 
592-594. 

Neethling,  C.  and  P.  Young  (1974).  "Comments  on  "Identification  of 
optimum  filter  steady- state  gain  for  systems  with  unknown  noise 
covariances","  IEEE  Trans.  Automat.  Contr.,  vol.  AC -19,  pp.  623-625. 

Nishimura,  T.  (1966).  "On  the  a priori  information  in  sequential 
estimation  problems,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC-11,  pp. 
197-204. 

Nishimura,  T.  (1967a).  "Correction  to  an  extension  of  "On  the  a priori 
information  in  sequential  estimation  problems","  IEEE  Trans.  Automat. 
Contr.,  vol.  AC-12,  pp.  123. 

Nishimura,  T.  (1967b).  "Error  bounds  of  continuous  Kalman  filters  and 
the  application  to  orbit  determination  problems,"  IEEE  Trans.  Automat. 
Contr.,  vol.  AC-12,  pp.  268-275. 


102 


Nishimura,  T.  (1970).  "Modeling  errors  in  Kalman  filters,"  in  Theory 
and  Application  of  Kalman  Filtering,  Leondes , C.T.,  Ed.,  AG ARDo graph 
139,  pp.  89-103. 

Nobel,  B.  and  J.W.  Daniel  (1977).  Applied  Linear  Algebra,  Prentice- 
Hall,  Englewood  Cliffs,  New  Jersey. 

Ohnishi,  K.  (1980).  "Direct  recursive  estimation  of  noise  statistics," 
in  Control  and  Dynamic  Systems,  vol.  16,  Leondes,  C.T.,  Ed.,  Academic 
Press,  New  York.,  pp.  249-297. 

Pachter,  M.  and  T.E.  Bullock  (1977).  "Ordering  and  stability  properties 
of  the  Riccati  equation,"  Nat.  Res.  Inst,  for  Math.  Sci.,  Rep.  WISK 
264,  Pretoria,  South  Africa. 

Patel,  R.V.  and  M.  Toda  (1980).  "Bounds  on  performance  of  nonstationary 
continuous- time  filters  under  modelling  uncertainty,"  Automatica,  vol 
20,  pp.  117-120. 

Pearson,  E.S.  and  H.O.  Hartley  (1972).  Biometrika  Tables  for 
Statisticians,  vol.  II,  University  Press,  Cambridge,  England. 

Poubelle , M.  , I.R.  Petersen,  M.R.  Gevers , and  R.R.  Bitmead  (1986).  "A 
miscellany  of  results  on  an  equation  of  Count  J.f.  Riccati,"  IEEE 
Trans.  Automat.  Contr. , vol.  AC-31,  pp.  651-654. 

Price,  C.F.  (1968).  "An  analysis  of  the  divergence  problem  in  the 

Kalman  filter,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC-12,  pp.  699-702. 

Rosenbrock,  H.H.  (1963).  "The  stability  of  linear  time -dependent 
control  systems,"  J.  Electron.  Contr.,  vol.  15,  pp.  73-80. 

Roy,  S.N.  (1953).  "On  a heuristic  method  of  test  construction  and  its 
use  in  multivariate  analysis,"  Ann.  Math.  Stat. , vol.  24,  pp.  220-238. 

Roy,  S.N.  (1957).  Some  Aspect  of  Multivariate  Analysis,  John  Wiley  & 
Sons,  New  York,  1957. 

Rudin,  W.  (1976).  Principles  of  Mathematical  Analysis,  McGraw-Hill,  New 
York. 

Sangsuk-Iam,  S.  and  T.E.  Bullock  (1987a).  "Behavior  of  the  discrete- 
time Kalman  filter  under  incorrect  noise  covariances,"  Proc.  26th 
IEEE  Conf.  on  Decision  and  Control,  Los  Angeles,  California. 

Sangsuk-Iam,  S.  and  T.E.  Bullock  (1987b).  "Analysis  of  continuous  - time 
Kalman  filtering  under  incorrect  noise  covariances,"  Automatica,  to 
be  published. 

Schlee , F.H. , C.J.  Standish,  and  N.F.  Toda  (1967).  "Divergence  in  the 
Kalman  filter,"  AIAA  J.,  vol.  5,  pp.  1114-1120. 

Skoog,  R.A.  and  C.G.Y.  Lau  (1972).  "Instability  of  slowly  varying 
systems,"  IEEE  Trans.  Automat.  Contr.,  vol.  AC -17,  pp.  86-92. 


103 


Tabuchi,  T. , T.  Soeda,  and  T.  Yoshimura  (1978).  "On  the  estimation  of 
noise  covariances  in  linear  discrete -time  systems,"  Int.  J.  Syst. 
Sci. , vol.  9,  pp.  531-542. 

Tajima,  K.  (1978).  "Estimation  of  steady-state  Kalman  filter  gain," 
IEEE  Trans.  Automat.  Contr. , vol.  AC-23,  pp.  944-945. 

Toda,  M.  and  R.V.  Patel  (1978).  "Performance  bounds  for  continuous- time 
filters  in  the  presence  of  modeling  errors,"  IEEE  Trans.  Aerosp . 
Electron.  Syst.,  vol.  AES-14,  pp.  912-918. 

Toda,  M.  and  R.V.  Patel  (1980).  "Bounds  on  estimation  error  of 
discrete- time  filter  under  modeling  uncertainty,"  IEEE  Trans.  Automat. 
Contr.,  vol.  AC-25,  pp.  1115-1121. 

Tse,  E.  and  H.L.  Weinert  (1975).  "Structure  determination  and  parameter 
identification  for  multivariable  stochastic  linear  systems,"  IEEE 
Trans.  Automat.  Contr.,  vol.  AC-20,  pp.  603-613. 

Vidyasagar,  M.  (1978).  Nonlinear  Systems  Analysis.  Prentice -Hall , 

Englewood  Cliffs,  New  Jersey. 

Weiss,  I.M.  (1970).  "A  survey  of  discrete  Kalman-Bucy  filtering  with 
unknown  noise  covariances,"  AIAA  Guidance,  Control  and  Flight 
Mechanics  Conf . , Santa  Barbara,  California. 

Willsky,  A.S.  (1976).  "A  survey  of  design  methods  for  failure  detection 
in  dynamic  systems,"  Automatica,  vol.  12,  pp.  601-611. 

Willsky,  A.S.,  J.J.  Deyst,  and  B.S.  Crawford  (1975).  "Two  self- test 
methods  applied  to  an  inertial  system  problem,"  J.  Spacecraft,  vol  12, 
pp.  434-437. 

Wong,  E.  and  B.  Hajek  (1985).  Stochastic  Processes  in  Engineering 
Systems,  Springer-Verlag,  New  York. 


BIOGRAPHICAL  SKETCH 


Suwanchai  Sangsuk-Iam  was  born  in  Samutsakhon,  Thailand,  in  1960. 
He  received  the  B.E.  degree  with  first  class  honors  from  Chulalongkorn 
University,  Bangkok,  Thailand,  in  1981  and  the  M.E.  degree  from  the 
University  of  Florida,  Gainesville,  in  1983,  all  in  electrical 
engineering. 

Upon  his  graduation  from  Chulalongkorn  University,  he  received  a 
medal  from  King  Pumipol  Adulyadech  in  recognition  of  his  four-year 
academic  performance.  In  1984,  he  was  with  the  Department  of  Electrical 
Engineering  and  Computer  Sciences,  University  of  California  at  Berkeley, 
where  he  was  awarded  the  Nonresident  Tuition  Scholarship  and  the  Eugene 
C.  and  Mona  Fay  Gee  Scholarship  after  having  studied  there  for  a 
semester.  Since  August  1985,  he  has  been  a research  assistant  and  a 
teaching  assistant  in  the  Department  of  Electrical  Engineering, 
University  of  Florida. 

His  present  research  interests  are  in  the  areas  of  control, 
estimation,  failure  detection,  identification,  sensor  fusion,  signal 
processing,  and  time  series. 


104 


I certify  that  I have  read  this  study  and  that  in  my  opinion  it 
conforms  to  acceptable  standards  of  scholarly  presentation  and  is  fully 
adequate,  in  scope  and  quality,  as  a dissertation  for  the  degree  of 
Doctor  of  Philosophy. 


T.E.  Bullock,  Chairman 

Professor  of  Electrical  Engineering 


I certify  that  I have  read  this  study  and  that  in  my  opinion  it 
conforms  to  acceptable  standards  of  scholarly  presentation  and  is  fully 
adequate,  in  scope  and  quality,  as  a dissertation  for  the  degree  of 
Doctor  of  Philosophy. 


il5 


Basil' 


Professor  of  Electrical  Engineering 


I certify  that  I have  read  this  study  and  that  in  my  opinion  it 
conforms  to  acceptable  standards  of  scholarly  presentation  and  is  fully 
adequate,  in  scope  and  quality,  as  a dissertation  for  the  degree  of 
Doctor  of  Philosophy. 


H . A . Latchman 

Assistant  Professor  of  Electrical 
Engineering 


I certify  that  I have  read  this  study  and  that  in  my  opinion  it 
conforms  to  acceptable  standards  of  scholarly  presentation  and  is  fully 
adequate,  in  scope  and  quality,  as  a dissertation  for  the  degree  of 
Doctor  of  Philosophy. 


If 


Z.R.  Pop-Stojanovic 
Professor  of  Mathematics 


I certify  that  I have  read  this  study  and  that  in  my  opinion  it 
conforms  to  acceptable  standards  of  scholarly  presentation  and  is  fully 
adequate,  in  scope  and  quality,  as  a dissertation  for  the  degree  of 
Doctor  of  Philosophy. 


c.f 

M.C.K.  Yang/^ 
Professor  bfj: 


sties 


This  dissertation  was  submitted  to  the  Graduate  Faculty  of  the  College 
of  Engineering  and  to  the  Graduate  School  and  was  accepted  as  partial 
fulfillment  of  the  requirements  for  the  degree  of  Doctor  of  Philosophy. 


December , 


1987 


/ UjU^  Q.  • 

Dean,  College  of  Engineering 


Dean,  Graduate  School 


